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(5?) Recombinant DNA polymerases from 
archaebacteria as well as isolated DNA coding 
for such polymerases are provided. The isolated 
DNA is obtained by use of DNA or antibody 
probes prepared from the DNA encoding 7". 
litoraiis DNA polymerase and the T. iitoralis DNA 
polymerase respectively. Also provided are 
methods for producing recombinant 
archaebacteria thermostable DNA polymerase 
and methods for enhancing the expression of 
such polymerases by identifying, locating and 
removing introns from within the DNA coding 
for such DNA polymerases. 
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FIELD OF THE INVENTION 

The present invention relates to recombinant DNA polymerases from archaebacterium, to isolated DNA 
coding for said DNA polymerases which hybridizes to DNA probes prepared from the DNA sequence coding 
5 for T. titoralis DNA polymerase, to DNA and antibody probes employed in the isolation of said DNA, as well as 
to related methods for isolating said DNA and methods of identifying, locating and removing intervening nu- 
cleotide sequences within said DNA in order to enhance expression of said DNA polymerases 

BACKGROUND OF THE INVENTION 

10 

DNA polymerases are a family of enzymes involved in DNA repair and replication. Extensive research has 
been conducted on the isolation of DNA polymerases from mesophilic microorganisms such as E. coli. See, 
for example, Bessman, etal., J. Biol. Chem. (1957) 233:171-177 and Buttin and Kornberg J. Biol. Chem.. (1966) 
241:5419-5427. 

15 Examples of DNA polymerases isolated from E. Coli include E. coli DNA polymerase I, Klenow fragment 

of E. coli DNA polymerase I and T4 DNA polymerase. These enzymes have a variety of uses in recombinant 
DNA technology including, for example, labelling of DNA by nick translation, second-strand cDNA synthesis in 
cDNA cloning, and DNA sequencing. See Maniatis, et al., Molecular Cloning: A Laboratory Manual (1982). 
Recently, U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159 disclosed the use of the above enzymes 

20 in a process for amplifying, detecting, and/or cloning nucleic acid sequences. This process, commonly referred 
to as polymerase chain reaction (PCR), involves the use of a polymerase, primers and nucleotide triphosphates 
to amplify existing nucleic acid sequences. 

Some of the DNA polymerases discussed above possess a 3*-5' exonuclease activity which provides a 
proofreading function that gives DNA replication much higher fidelity that it would have if synthetis were the 

25 result of only a one base-pairing selection step. Brutlag, D. and Kornberg, A. t J. Biol. Chem., (1972) 247:241- 
248. DNA polymerases with 3'-5' proofreading exonuclease activity have a substantially lower base incorpor- 
ation error rate when compared with a non-proofreading exonuclease-possessing polymerase. Chang, L.M.S., 
J. Biol. C/ie/n., (1 977) 252: 1 873-1 880. 

Research has also been conducted on the isolation and purification of DNA polymerases from thenmo- 

30 philes, such as Thermus aquaticus. Chien, A, et al. J. Bacteriol.. (1 976) 1 27:1 550-1557, discloses the isolation 
and purification of a DNA polymerase with a temperature optimum of 80°C from T. aquaticus YT1 strain. The 
Chien, et al., purification procedure involves a four-step process. These steps involve preparation of crude ex- 
tract, DEAE-Sephadex chromatography, phosphocellulose chromatography, and chromatography on DNA cel- 
lulose. Kaledin, et al., Biokhymiyay (1980) 45:644-651 also discloses the isolation and purification of a DNA 

35 polymerase from cells of T. aquaticus YT1 strain. The Kaledin, et al. purification procedure involves a six-step 
process. These steps involve isolation of crude extract, ammonium sulfate precipitation, DEAE-cellulose chro- 
matography, fractionation on hydroxyapatite, fractionation on DEAE-cellulose, and chromatography on single- 
strand DNA-celluiose. 

United States Patent No. 4,689,818 discloses a purified thermostable DNA polymerase from 7! aquaticus, 

w Taq polymerase, having a molecular weight of about 86,000 to 90,000 daltons prepared by a process su bstan- 
tially identical to the process of Kaledin with the addition of the substitution of a phosphocellulose chromatog- 
raphy step in lieu of chromatography on single-strand DNA-cellulose. In addition, European Patent Application 
0258017 discloses Taq polymerase as the preferred enzyme for use in the PCR process discussed above. 
Research has indicated that while the Taq DNA polymerase has a 5-3' polymerase-dependent exonu- 

45 ' clease function, the Taq DNA polymerase does not possess a 3*-5' proofreading exonuclease function. Lawyer, 
F.C., et al. J. Bioi.Chem., (1989) 264:11, p. 6427-6437. Bernard, A., et al. Ce//(1989) 59:219. As a result, Taq 
DNA polymerase is prone to base incorporation errors, making its use in certain applications undesirable. For 
example, attempting to clone an amplified gene is problematic since any one copy of the gene may contain an 
error due to a random misincorporation event. Depending on where in the replication cycle that error occurs 

50 (e.g., in an early replication cycle), the entire DNA amplified could contain the erroneously incorporated base, 
thus, giving rise to mutated gene product. Furthermore, research has indicated that Taq DNA polymerase has 
a thermal stability of not more than several minutes at 100°C. 

Accordingly, other DNA polymerases with comparable or improved thermal stability and/or 3* to 5' exonu- 
clease proofreading activity would be desirable for the scientific community. One such enzyme (described in 

55 more detail below), DNA polymerase from Thermococcus litoratis, an archaebacterium that grows at temper- 
atures close to 100°C near submarine thermal vents, has been cloned into E. coli. The production of large 
amounts of this recombinant enzyme protein from this gene is complicated, however, by the presence of two 
introns, one of which must be removed by genetic engineering techniques, and the other which encodes an 
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endo nuclease which is s pi iced out in E. coli. 

It would be desirable to obtain and produce other highly thermostable DNA polymerases from archaebac- 
terium which have a 3' to 5' proofreading activity and/or comparable or improved thermal stability so as to im- 
prove the DNA polymerase processes described above. 

5 

SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided methods and products for identifying, isolating 
and cloning DNA which encodes DNA polymerases from archae bacteria. The present invention also relates to 
10 recombinant DNA polymerases from archaebacteria as well as to methods of improving expression of said re- 
combinant DNA polymerases by identifying, locating and removing intervening nucleotide sequences orintrons 
which occur within the DNA coding for said polymerases. 

More specifically, in accordance with the present invention, it has been discovered that DNA coding for 
DNA polymerases from archaebacterium have substantial homology both at the DNA and amino acid level. It 
15 has also been discovered that the DNA from archaebacterium coding for such enzymes appear to have one 
of more intervening nucleotides or introns which also share substantially homology at the DNA level. 

Thus, in accordance with the present invention, DNA probes can be constructed from the DNA sequence 
coding for one DNA polymerase from archaebacterium, such as Thermococcus Htoralis, and used to identify 
and isolate DNA coding for DNA polymerases from other Archaebacterium such as Pyrococcus. Similarly, an- 
20 tibody probes which are cross-reactive with T. Htoralis DNA polymerase can also be used to identify DNA coding 
coding sequences which express such other DNA polymerases. 

Once the DNA coding for the target DNA polymerase has been isolated, it can be used to construct ex- 
pression vectors in order to produce commercial quantitaties of the target DNA polymerase. In this regard, the 
present invention also provides methods of increasing expression levels of the target DNA polymerase by iden- 
25 tifying, locating and removing any intervening nucleotide sequences or introns which occur in the DNA se- 
quence coding for the DNA polymerase. As discussed below, while certain introns are spliced out in E. coli. t 
expression of the recombinant DNA polymerase can be enhanced by removal of such intervening nucleotide 
sequences prior to expression in E. coli. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 A - is a photograph of the SDS-polyacryl amide gel of example 1 . 

FIG. 1 B - is a graph showing the polymerase activity and exonuclease activity of the proteins eluted from 
lane 2 of the gel in Fig. 1A. 

35 FIG. 2 - is a restriction site map of the Xba fragment containing the gene encoding the 7^ Htoralis DNA 
Polymerase which is entirely contained within the BamHI fragment of bacteriophage NEB 619. 
FIG. 3 - Figures 3A and 3B are graphs showing the half-life of native and recombinant T. Htoralis DNA, 
respectively. 

FIG. 4 - is a graph showing the response of T. Htoralis DNA polymerase and Klenow fragment to the pres- 
to ence or absence of deoxynucleotides. 

FIG. 5 - is a restriction site map showing the organization of the T. Htoralis DNA polymerase gene in native 

DNA (BamHI fragment of NEB 61 9) and in E. coli NEB671 and NEB687. 
FIG. 6 - is a partial nucleotide sequence of the 14 kb BamHI restriction fragment of bacteriophage 
NEB619 inclusive of the 1.3 kb t 1.6 kb and 1 .9 kb Eco Rl fragments and part of the Eco RI/BamHI 
45 ' fragment. 

FIG. 7 - is a comparison of the amino acids in the DNA polymerase consensus homology region III with 

the amino acids of the 71 Htoralis homology island III. 
FIGS. 8-10 are representations of the vectors pPR969 and pCAS4 and V1 74-1 B1, respectively. 
FIG. 11 - is a graph illustrating the 7: Htoralis DNA polymerase variant constructed in Example VI lacks de- 
50 tectable 3'to 5' exonuclease activity. 

FIG. 12 - is a nucleotide sequence of the primers used in Example III. 

FIG. 13A- is the ethidium bromide stained agarose gel of Pyrococcus sp. DNA cut with EcoR I (lane 3), 
BamH I (lane 4) and Hind III (lane 5). Lane 1 is XDNA cut with Hind III as markers and lane 2 is 
pBR322 as a marker. 

55 FIG. 13B- is an autoradiography of a Southern hybridization of the same gel in Fig. 13A. The ^P-DNAprobe 
was prepared from a 1 .3 Kb Eco Rl fragment that encodes the amino terminal portion of the T. 
Htoralis DNA polymerase. Note that the BamH I cut Pyrococcus sp. DNA gives a single band of 
about 4-5 Kb with the probe. The fact that the 23 Kd band of Hind III cut XDNA shows up on the 
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film is due to nonspecific hybridization to the large amount of DNA present in that band. The fact 
that the plasmid pBR322 lights up is due to homologous sequences in the probe, 
is a restriction site map of the 4.8 Kb BamH I fragment containing the gene containing the Pyr- 
ococcus sp. DNA polymerase in the pUC19 plasmid of E. coli 2207 (NEB#720). 
illustrates the probes used to analyze the similarity of DNA for other target archaebacteria. 
is an autoradigraph of quadruplicate Southern blots decribed in Example XIV illustrating the hy- 
bridization of probes to T. litoralis and Pyrococcus sp. DNA but not to T. aquaticus DNA. 
is a Western blot of crude lysates from T. litoralis (V), Pyrococcus sp. G-l-J (J), Pyrococcus sp. 
G-l-H (H), or purified polymerases from Pyrococcus sp. GB-D (DV), T. aquaticus (T) or E. coli 
(E) reacted with affinity purified anti-vent DNA polymerase antibody in Part A or anti-Taq DNA 
polymerase antibody in Part B. M represents the marker proteins. The arrow indicates the position 
of the T. litoralis and Pyrococcus sp. DNA polymerase proteins. The reactivity in Part B is to back- 
ground proteins and not to the DNA polymerases as seen in Part A. 

is a partial DNA nucleotide sequence of the gene coding for the Pyrococcus sp. DNA polymerase, 
is a comparison of the deduced amino acid sequence of Pyrococcus sp. DNA polymerase to T. 
litoralis DNA polymerase. 

DETAILED DESCRIPTION OF THE INVENTION 

20 In accordance with one preferred embodiment of the present invention, there is provided a method of pro- 

ducing recombinant DNA polymerase from archaebacterium. The preferred process comprises Informing a 
genomic library from the target archaebacterium, 2) transforming or transfecting an appropriate host cell, 3) 
either i) reacting the DNA from the transformed or transfected host cells with a DNA probe which hybridizes to 
the DNA coding for the DNA polymerase from T. litoralis, or ii) reacting the extract from the transformed or trans- 

25 fected host cells with an antibody probe which is cross-reactive with T. litoralis DNA polymerase, 4) assaying 
the transformed or transfected cells of step 3 which either hybridize to the DNA probe or cross react with the 
T. litoralis specific antibody for thermostable DNA polymerase activity. 

The aforementioned method allows for the production of recombinant DNA polymerases from archaebac- 
terium, as well as for the isolation of DNA coding for said polymerases. 

30 in accordance with another preferred embodiment, there is provided a method for enhancing the expression 
of recombinant DNA polymerases from archaebacterium. As noted above, it is believed that the DNA coding 
for DNA polymerases from archaebacterium may possess one or more introns which may complicate expres- 
sion of the target recombinant DNA polymerase. Location and removal of these introns prior to constructing 
the expression system has been found to enhance expression of the target DNA polymerase, even when the 

35 intron is normally spliced out in its host cell. As discussed in more detail below, the intron can be identified and 
removed in a number of ways. In particular, it has also been found that the introns of T. litoralis share substantial 
homology at the DNA level with other genuses of archaebacteria such as Pyrococcus. Knowledge of this fact 
should facilitate the identification, location and removal of introns by the methods described in more detail be- 
low. 

40 In practicing certain embodiments of the present invention it is preferable to employ either i) DNA probes 

which hybridize to the DNA coding for T. litoralis DNA polymerase, or ii) antibodies which cross-react with T. 
litoralis DNA polymerase. DNA probes are preferably constructed based on the DNA sequence coding for the 
T. litoralis DNA polymerase (See Fig. 6), while the antibody probes are preferably made from the purified T. 
litoralis enzyme itself. Following the procedures of the present invention, one could, of course construct probes 

45 ' based on the DNA polymerase or its DNA from other sources of archaebacterium. However, the preferred DNA 
polymerase and DNA used to construct such probes is from T. litoralis. 

Production of Native 7*. litoralis DNA Polymerase 

so T. litoralis DNA polymerase is obtainable from T. litoralis strain NS-C (DSM No. 5473, a sample of which 

has also been deposited at the American Type Culture Collection on September 17, 1 991 under ATCC Acces- 
sion No. 55233). T. litoralis was isolated from a submarine thermal vent near Naples, Italy in 1985. This organ- 
ism, T. litoralis, is an extremely thermophilic, sulfur metabolizing, archaebacteria, with a growth range between 
55°C and 98°C. Neuner, eta\. t Arch. Microbiol. (1990) 153:205-207. 

55 For recovering the native protein, T. litoralis may be grown using any suitable technique, such as the tech- 

nique described by Belkin, et al., Arch Microbiol. (1985) 142:181-186, the disclosure of which is incorporated 
by reference. Briefly, the cells are grown in the media described above containing 10 mg/ml of sulfur and 0.01 
M cysteine in 15 ml screw cap tubes at 95°C for2 days. When larger amounts of cells are required, 1 liter screw 
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cap bottles are used and after sterilization are inocculated with a fresh 10 ml culture and grown at 90-95?C for 
2 days. 

After cell growth, one preferred method for isolation and purification of the enzyme is accomplished using 
the multi-step process as follows: 
5 First, the cells, if frozen, are thawed, suspended in a suitable buffer such as buffer A (10 mM KP0 4 buffer, 

pH 7.4; 1.0 mM EDTA, 1.0 mM beta-mercaptoethanol), sonicated and centrifuged. The supernatant is then 
passed through a column which has a high affinity for proteins that bind to nucleic acids such as Affigel blue 
column (Biorad). The nucleic acids present in supernatant solution of 7^ litoralis and many of the proteins pass 
through the column and are thereby removed by washing the column with several column volumes of low salt 

10 buffer at pH of about 7.0. After washing, the enzyme is eluted with a linear gradient such as 0.1 to 2.0 M NaCI 
buffer A. The peak DNA polymerase activity is dialyzed and applied to a phosphocellulose column. The column 
is washed and the enzyme activity eluted with a linear gradient such as 0.1 to 1.0 M NaCI in buffer A. The peak 
DNA polymerase activity is dialyzed and applied to a DNA cellulose column. The column is washed and DNA 
polymerase activity is eluted with a linear gradient of 0.1 to 1.0 M NaCI in buffer A. The fractions containing 

15 DNA polymerase activity are pooled, dialyzed against buffer A, and applied to a high performance liquid chro- 
matography column (HPLC) mono-Q column (anion exchanger). The enzyme is again eluted with a linear gra- 
dient such as 0.05 to 1 .0 M NaCI in a buffer A. The fractions having thermostable polymerase activity are pooled, 
diluted and applied to HPLC mono-S column (cation exchanger). The enzyme is again eluted with a linear gra- 
dient such as 0.05 to 1 .0 M NaCI in buffer A. The enzyme is about 50% pure at this stage. The enzyme may 

20 be further purified by precipitation of a contaminating lower molecular weight protein by repeated dialysis 
against buffer A supplemented with 50 mM NaCI. 

The apparent molecular weight of the DNA polymerase obtainable from 7". litoralis is between about 90,000 
to 95,000 daltons when compared with protein standards of known molecular weight, such as phosphorylase 
B assigned a molecular weight of 97,400 daltons. It should be understood, however, that as a protein from an 

25 extreme thermophile, T. litoralis DNA polymerase may electro phorese at an aberrant relative molecular weight 
due to failure to completely denature or other intrinsic properties. The exact molecular weight of the thermo- 
stable enzyme of the present invention may be determined from the coding sequence of the 7: litoralis DNA 
polymerase gene. The molecular weight of the eluted product may be determined by any technique, for exam- 
ple, by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) using protein molecular weight markers. 

30 Polymerase activity is preferably measured by the incorporation of radioactively labeled deoxynucleotides 
into DNAse-treated, or activated, DNA; following subsequent separation of the unincorporated deoxynucleo- 
tides from the DNA substrate, polymerase activity is proportional to the amount of radioactivity in the acid- 
insoluble fraction comprising the DNA. Lehman, I.R., et al., J. Biol. Chem. (1958) 233:163, the disclosure of 
which is incorporated herein by reference. 

35 The half-life of the DNA polymerase of the present invention at 100°C is about 60 minutes. The thermal 
stability or half-life of the DNA polymerase is determined by preincubating the enzyme at the temperature of 
interest in the presence of all assay components (buffer, MgCI 2 , deoxynucleotides, and activated DNA) except 
the single radioactively-labeled deoxynucleotide. At predetermined time intervals, ranging from 4-180 minutes, 
small aliquots are removed, and assayed for polymerase activity using the method described above. 

40 The half-life at 100°C of the DNA polymerase can also be determined in the presence of stabilizers such 

as the nonionic detergent octoxynol, commonly known as TRITON X-100 (Rohm & Haas Co.), or the protein 
bovine serum albumin (BSA). The non-ionic detergents polyoxyethylated (20) sorbitan monolaurate (Tween 20, 
ICI Americas Inc.) and ethoxyiated alkyl Phenol (nonyl) (ICONOL NP-40, BASF Wyandotte Corp.) can also be 
used. Stabilizers are used to prevent the small amount of enzyme added to the reaction mixture from adhering 

45 ' to the sides of the tube or from changing its structural conformation in some manner that decreases its enzy- 
matic activity. The half-life at 100°C of the DNA polymerase obtainable from T. litoralis in the presence of the 
stabilizer TRITON X-100 or BSA is about 95 minutes. 

Preparation Of Recombinant T. litoralis DNA Polymerase 

50 

T. litoralis DNA polymerase may also be produced by recombinant DNA techniques, as the gene encoding 
this enzyme has been cloned from T. litoralis genomic DNA. The complete coding sequence for the T. litoralis 
DNA polymerase (Figure 6) can be derived from bacteriophage NEB #619 on an approximately 14 kb BamHI 
restriction fragment. This phage was deposited with the American Type Culture Collection (ATCC) on April 24, 
55 1 990 and has Accession No. ATCC 40795. 

; The production of a recombinant form of T. litoralis DNA polymerase generally includes the following steps: 
DNA is isolated which encodes the active form of the polymerase, either in its native form or as a fusion with 
other sequences which may or may not be cleaved away from the native form of the polymerase and which 
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may or may not effect polymerase activity. Next, the gene is operably linked to appropriate control sequences 
for expression in either prokaryotic or eukaryotic host/vector systems. The vector preferably encodes all func- 
tions required for transformation and maintenance in a suitable host, and may encode selectable markers 
and/or control sequences for T. Htoralis polymerase expression. Active recombinant thermostable polymerase 

5 can be produced by transformed host cultures either continuously or after induction of expression. Active ther- 
mostable polymerase can be recovered either from within host cells or from the culture media if the protein is 
secreted through the cell membrane. 

While each of the above steps can be accomplished in a number of ways, it has been found that for cloning 
the DNA encoding T, Htoralis DNA polymerase, expression of the polymerase from its own control sequences 

10 in E. coli results in instability of the polymerase gene, high frequency of mutation in the polymerase gene, slow 
cell growth, and some degree of cell mortality. 

While not wishing to be bound by theory, it is believed that this instability is due at least in part to the pres- 
ence of a 1614 bp intron that splits the T. Htoralis DNA polymerase gene from nucleotides 1776 to 3389 of Fig. 
6, and a second 1170 bp intron that splits the T. Htoralis DNA polymerase gene from nucleotides 3534 to 4703. 

15 As discussed below, intervening sequences are also believed to be present in the DNA coding for DNA poly- 
merases from other archaebacteria. Introns from a number of archaebacteria are also believed to share sub- 
stantial homology to the introns present in the DNA for coding for T. Htoralis DNA polymerase, which, in accor- 
dance with one aspect of the present invention, will facilitate their identification, location and removal. 

Introns are stretches of intervening DNA which separate coding regions of a gene (the protein coding re- 

20 gions are called exons). Introns can contain nonsense sequences or can code for proteins. In order to make 
a functional protein, the intron must be spliced out of the pre-mRNA before translation of the mature mRNA 
into protein. Introns were originally identified in eukaryotes, but have been recently described in certain pro- 
karyotes. (See, e.g., Krainer and Maniatis (Transcription and Splicing (1988) B.D. Hames and D.M. Glover, eds. 
IRL Press, Oxford and Washington, D.C. pp. 131-206)). When a gene with an intron is transcribed into mRNA 

25 the intron may self-splice out to form a mature mRNA or cellular factors may be required to remove the intron 
from the pre-mRNA. Id. Bacterial introns often require genus specific co-factors for splicing. For example, a 
Bacillus intron may not be spliced out in E. coli. (Id). 

However, there is some evidence that suggests that the intervening DNA sequence within the gene coding 
for the T. Htoralis DNA polymerase is transcribed and translated, and that the peptide produced therefrom is 

30 spliced out at the protein level, not the mRNA level. Therefore, regardless of where the splicing event occurs, 
in order to express T. Htoralis DNA polymerase in E. co//, it is preferable to delete the intervening sequence 
prior to expression of the polymerase in an E. coli system. Of course, the recombinant vector containing the T. 
Htoralis DNA polymerase gene could be expressed in systems which possess the appropriate factors for splicing 
the intron, for example, a Thermococcus system. 

35 it is also preferable that the T. Htoralis thermostable polymerase expression be tightly controlled in E. coli 

during cloning and expression. Vectors useful in practicing the present invention should provide varying de- 
grees of controlled expression of 7". Htoralis polymerase by providing some or all of the following control fea- 
tures : (1) promoters or sites of initiation of transcription, either directly adjacent to the start of the polymerase 
or as fusion proteins, (2) operators which could be used to turn gene expression on or off, (3) ribosome binding 

40 sites for improved translation, and (4) transcription or translation termination sites for improved stability. Ap- 
propriate vectors used in cloning and expression of 7: Htoralis polymerase include, for example, phage and plas- 
mids. Example of phage include Xgtit (Promega), X Dash (Stratagene) X Zapll (Stratagene). Examples of plas- 
mids include pBR322, pBluescript (Stratagene), pSP73 (Promega), pGW7 (ATCC No. 40166), pET3A (Rosen- 
berg, etal., Gene, (1987) 56:125-135), and pET11C (Methods in Enzymotogy (1990) 185:60-89). 

45 ' ■ 

Transformation and Infection 

Standard protocols exist for transformation, phage infection and cell culture. Maniatis, et al., Molecular 
Cloning : A Laboratory Manual (1982). Of the numerous E. coli strains which can be used for plasmid trans- 

50 formation, the preferred strains include JM101 (ATCC No. 33876), XL1 (Stratagene), and RRI (ATCC No. 
31343), and BL21(DE3) plysS (Method in Enzomotogy (1990) supra). E. coli strain XL1, ER1578 and ER1458 
(Raleigh, etal., N.A. Research (1988) 16:1563-1 575) are among the strains that can be used for lambda phage, 
and Y1089 can be used for lambda gtll lysogeny. When preparing transient lysogens in Y1089 (Arasu, et al., 
Experimental Parasitology (1987) 64:281-289), a culture is infected with lambda gtll recombinant phage either 

55 by a single large dose of phage or by co-culturing with a lytic host The infected Y1 089 cells are preferably grown 
at 37°C in the presence of the inducer IPTG resulting in buildup of recombinant protein within the lysis-defective 
host/phage system. 
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Construction of Genomic DNA Expression Library and Screening for Thermostable Polymerase 

The most common methods of screening for a gene of choice are (1) by hybridization to homologous genes 
from other organisms, (2) selection of activity by complementation of a host defect, (3) reactivity with specific 

5 antibodies, or (4) screening for enzyme activity. For T. Utoralis, antibody detection is preferred since it initially 
only requires expression of a portion of the enzyme, not the complete "active enzyme. The instability of the 7! 
Utoralis polymerase gene in E. coli would have made success by other methods more difficult. 

T. Utoralis DNA can be used to construct genomic libraries as either random fragments or restriction enzyme 
fragments. The latter approach is preferred. Preferably, Eco Rl partials are prepared from T. Utoralis genomic 

10 DNA using standard DNA restriction techniques such as described in Maniatis, etal., Molecular Cloning : A lab- 
oratory Manual (1 982), the disclosure of which is incorporated herein by reference. Other restriction enzymes 
such as BamHI, Nrul and Xbal can also be used. 

Although methods are available to screen both plasmids and phage using antibodies (Young and Davis, 
PNAS, (1983) 80:1194-1198), in accordance with the present invention it has been found that phage systems 

15 tend to work better and are therefore preferred for the first libraries. Since it is uncertain whether T. Utoralis con- 
trol regions function in E. coli, phage vectors which supply all necessary expression control regions such as 
lambda gt11 and lambda Zap II, are preferred. By cloning T. Utoralis DNA into the Eco Rl site of lambda gtll, T. 
Utoralis polymerase may be expressed either as a fusion protein with beat-galactosidase or from its own en- 
dogenous promoter. 

20 Once formed, the expression libraries are screened with mouse anti- T. Utoralis DNA polymerase antiserum 

using standard antibody/plaque procedures such as those described by Young and Davis, PAMS(1983), supra. 

The mouse anti- £ Utoralis DNA polymerase antiserum used to screen the expression libraries can be pre- 
pared using standard techniques, such as the techniques described in Harlow and Cane, Antibodies: A Labo- 
ratory Manual (1988) CSH Press, the disclosure of which is incorporated herein by reference. Since most sera 

25 react with E. coli proteins, it is preferable that the T. Utoralis polymerase antisera be preabsorbed by standard 
methods against E. coli proteins to reduce background reactivity when screening expression libraries. Phage 
reacting with anti- T. Utoralis polymerase antiserum are picked and plaque purified. Young and Davis, PNAS 
(1983), supra. 

The T. Utoralis DNA polymerase DNA, coding for part or the whole gene, can then be subcloned in, for ex- 
30 ample, pBR322, pBIuescript, m13 or pUC19. If desired, the DNA sequence can be determined by, for example, 
the Sanger dideoxy chain-terminating method (Sanger, R, Nicklen, S. & Coulson, A.R. PNAS (1977) 74:5463- 
5467). 

Identification of DNA Encoding and Expression of the T. Utoralis DNA Polymerase 
35 — — 

Several methods exist for determining that the DNA sequence coding for the £ Utoralis DNA polymerase 
has been obtained. These include, for example, comparing the actual or deduced amino-terminal sequence of 
the protein produced by the recombinant DNA to the native protein, or determining whether the recombinant 
DNA produces a protein which binds antibody specific for native T. Utoralis DNA polymerase. In addition, re- 

40 search by Wang, et al., FASEB Journal (1 989) 3:20 suggests that certain regions of DNA polymerase sequenc- 
es are highly conserved among many species. As a result, by comparing the predicted amino acid sequence 
of the cloned gene product with the amino acid sequence of known DNA polymerases, such as human DNA 
polymerase and E. coli phage T4 DNA polymerase, the identification of these islands of homology provides 
strong evidence that the recombinant DNA indeed encodes a DNA polymerase. Once identified, the DNA se- 

45 ' que nee coding for the 7^ Utoralis DNA polymerase, can be cloned into an appropriate expression vector such 
as a plasmid derived from E. coli, for example, pET3A, pBIuescript or pUC19, the plasmids derived from the 
Bacillus subtilis such as pUB110, pTP5 and pC194, plasmids derived from yeast such as pSH19 and pSH15, 
bacteriophage such as lambda phage, bacteria such as Agrobacterium tumefaciens, animal viruses such as 
retroviruses and insect viruses such as Baculovirus. 

50 As noted above, in accordance with the present invention, it has been found that DNA coding for T. Utoralis 

DNA polymerase contains two introns: i) an 1614 bp intron or intervening sequence, spanning from nucleotides 
1776 to 3389 in Figure No. 6, and ii) an 1170 bp intron, spanning nucleotides 3534 to 4703 in Figure 6. This 
1170 bp intron codes for an endonuclease and is found to self-splice out in E. coli. Prior to overexpression in 
host cells such as E. coli, it is preferable to delete the DNA sequence coding for both the 1614 and 1170 bp 

55 introns. Even though the 1170 bp intron splices out in E. coli., it has been found that expression vectors which 
do not contain this intron result in increased production of the desired polymerase. 

In general, once an intron has been identified and located within a nucleotide sequence, there are a number 
of approaches known in the art which can be used to delete DNA sequences and therefore splice out an intron 
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in-vitro. One method involves identifying unique restriction enzyme sites in the coding region which are near 
the splice junction or area to be deleted. A duplex oligomer is synthesized to bridge the gap between the two 
restriction fragments. A three-part ligation consisting of the amino end restriction fragment, the bridging oligo 
and the carboxy end restriction fragment yields an intact gene with the intron deleted. 

5 Another method is a modification of the above-described method. The majority of the intron is deleted by 

cutting with restriction enzymes with unique sites within the intron, but close to the coding sequence border. 
The linear plasmid containing a deletion of the majority of the intron is ligated together. Single strand phage 
are generated from the pBluescript vector recombinant by super infection with thefl helper phage I R1. A singe! 
strand oligomer is synthesized with the desired final sequence and is annealed to the partially deleted intron 

10 phage DNA. The remainder of the intron is thus looped out. By producing the original phage in E coll strain 
CJ236 the Kunkel method of mutagenesis Methods in Enymology 154:367 (1987)) can be used to select for 
the full deleted intron constructs. 

Yet another method which can be used to delete the intron uses DNA amplification. See, for example, Man- 
iatis, et al., Molecular Cloning: A Laboratory Manual, (1989) Vol. 2, 2nd edition, the disclosure of which is herein 

15 incorporated by reference. Briefly, primers are generated to amplify and subsequently join the amino and car- 
boxy! halves of the gene. 

When an intron is deleted in-vitro, using the methods discussed above, the native splice junction may be 
unknown. Accordingly, one skilled in the art would predict that several possible artificial splice junctions exist 
that would result in the production of an active enzyme. 

20 Once the intron is deleted, overexpression of the 7". litoralis DNA polymerase can be achieved, for example, 

by separating the T. litoralis DNA polymerase gene from its endogenous control elements and then operably 
linking the polymerase gene to a very tightly controlled promoter such as a T7 expression vector. See, Rosen- 
berg, et al., Gene (1987) 56:125-135, which is hereby incorporated by reference. Insertion of the strong pro- 
moter may be accomplished by identifying convenient restriction targets near both ends of the T. litoralis DNA 

25 polymerase gene and compatible restriction targets on the vector near the promoter, or generating restriction 
targets using site directed mutagenesis (Kunkel (1984) supra), and transferring the T. litoralis DNA polymerase 
gene into the vector in such an orientation as to be under transcriptional and translational control of the strong 
promoter. 

T. litoralis DNA polymerase may also be overexpressed by utilizing a strong ribosome binding site placed 
30 upstream of the T. litoralis DNA polymerase gene to increase expression of the gene. See, Shine and Dalgamo, 
Proc. Natl. Acad. Sci. USA (1974) 71:1342-1346, which is hereby incorporated by reference. 

The recombinant vector is introduced into the appropriate host using standard techniques for transforma- 
tion and phage infection. For example, the calcium chloride method, as described by Cohen, S.N., PNAS (1972) 
69:2110 is used for E. coli, the disclosure of which is incorporated by reference. The transformation of Bacillus 
35 is earned out according to the method of Chang, S., et al„ Molecular and General Genetics (1979) 168:111, 
the disclosure of which is incorporated by reference. Transformation of yeast is carried out according to the 
method of Parent, etal., Yeasr (1985) 1:83-138, the disclosure of which is incorporated by reference. Certain 
plant ceils can be transformed with Agrobacterium tumefaciens, according to the method described by Shaw, 
C.H., et al.. Gene (1983) 23:315, the disclosure of which is incorporated by reference. Transformation of animal 
40 cells is carried out according to, for example, the method described in Virology (1973) 52:456, the disclosure 
of which is incorporated by reference. Transformation of insect cells with Baculovirus is carried out according 
to, for example, the method described in Biotechnology (1988) 6:47, the disclosure of which is incorporated 
herein by reference. 

The transformants are cultivated, depending on the host cell used, using standard techniques appropriate 
45 ' to such cells. For example, for cultivating E. coli, cells are grown in LB media (Maniatis, supra) at 30°C to 42°C 
to mid log or stationary phase. 

The T. litoralis DNA polymerase can be isolated and purified from a culture of transformed host cells, for 
example, by.either extraction from cultured cells or the culture solution. 

When the T. litoralis DNA polymerase is to be extracted from a cultured cell, the cells are collected after 
50 cultivation by methods known in the art, for example, centrifugation. Then, the collected cells are suspended 
in an appropriate buffer solution and disrupted by ultrasonic treatment lysozyme and/or freeze- thawing. A crude 
extract containing the T. litoralis DNA polymerase is obtained by centrifugation and/or filtration. 

When the T. litoralis DNA polymerase is secreted into the culture solution, i.e., alone or as a fusion protein 
with a secreted protein such as maltose binding protein, the supernatant is separated from the celis by methods 
55 known in the art 

The separation and purification of the T. litoralis DNA polymerase contained in the culture supernatant or 
the cell extract can be performed by the method described above, or by appropriate combinations of known 
separating and purifying methods. These methods include, for example, methods utilizing solubility such as 
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salt precipitation and solvent precipitation, methods utilizing the difference in molecular weight such as dialysis, 
ultra-filtration, gel-filtration, and SDS-polyacrylamide gel electrophoresis, methods utilizing a difference in elec- 
tric charge such as ion-exchange column chromatography, methods utilizing specific affinity such as affinity 
chromatography, methods utilizing a difference in hydrophobicity such as reverse-phase high performance liq- 
5 uid chromatography and methods utilizing a difference in isoelectric point such as isoelectric focusing electro- 
phoresis. 

One preferred method for isolating and purification of the recombinant enzyme is accomplished using the 
multi-stage process as follows. 

First, the cells, if frozen are thawed, suspended in a suitable buffer such as Buffer A (100 mM NaCI, 25 
10 mM Tris pH 7.5, 0.1 mM EDTA, 10% glycerol, 0.05% Triton X-100), lysed and centrifuged. The clarified crude 
extract is then heated to 75°C for approximately 30 minutes. The denatured proteins are removed by centrifu- 
gation. The supernatant is then passed through a column that has high affinity for proteins that bind to nucleic 
acids such as Affigel Blue column (Biorad). The nucleic acids present in the supernatant solution and many of 
proteins pass through the column and are thereby removed by washing the column with several column vol- 
ts umes with low-salt buffer at pH of about 7.0. After washing, the enzyme is eluted with a linear gradient such 
as 0.1 M to 1.5 M NaCI Buffer A. The active fractions are pooled, dialyzed and applied to a phosphocellulose 
column. The column is washed and DNA polymerase activity eluted with a linear gradient of 0.1 to 1.0 M NaCI 
in Buffer B (100 M NaCI, 15 mM KP0 4 , 0.1 mM EDTA, 10% glycerol, 0.05% Triton X-100, pH 6.8). The fractions 
are collected and BSA is added to each fraction. The fractions with DNA polymerase activity are pooled. The 
20 T. Htoralis DNA polymerase obtained may be further purified using the standard product purification techniques 
discussed above. 

Stabilization and Use of the T. liioralis DNA Polymerase ' 

25 For long-term storage, the thermostable enzyme of the present invention is stored in the following buffer 

0.05 M NaCI, 0.01 M KP0 4 (pH 7.4), 0.1 mM EDTA and 50% glycerol at -20°C. 

The T. Htoralis DNA polymerase of the present invention may be used for any purpose in which such an 
enzyme is necessary or desirable. For example, in recombinant DNA technology including, second-strand 
cDNA synthesis in cDNA cloning, and DNA sequencing. See Maniatis, et al., supra. 

30 The T. liioralis DNA polymerase of the present invention may be modified chemically or genetically to in- 

activate the 3*-5' exonuclease function and used for any purpose in which such a modified enzyme is desirable, 
e.g., DNA sequencing. 

For example, genetically modified T. Htoralis DNA polymerase may be isolated by randomly mutagen izing 
the T. Htoralis DNA polymerase gene and then screening for those mutants that have lost exonuclease activity, 
35 without loss of polymerase activity. Alternatively, genetically modified T. Htoralis DNA polymerase is preferably 
isolated using the site-directed mutagenesis technique described in Kunke!, T.A., PNAS (1985) 82:488-492, 
the disclosure of which is herein incorporated by reference. 

In addition, the T. Htoralis DNA polymerase of the present invention may also be used to amplify DNA, e.g., 
by the procedure disclosed in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159. 

40 

Construction of Genomic DNA Library and Screening for Thermostable Polymerase from Archaebacteria 
other than T. Htoralis 

In accordance with the present invention, cross hybridization of a target Archaebacterium genomic DNA 
45 ' library using an DNA probe prepared from the DNA polymerase gene of T. Htoralis and/or cross-reactivity with 
mouse anti-I Htoralis antiserum allows for the identification and isolation of the DNA polymerase genes from 
other archaebacterium, such as Methanococcus, Methanobacter, Methanomicrobium, Halobacter, Thermo- 
plasma, Thermococcus, Pyrococcus, and the like (see, e.g. Woese, C, Microbiological Reviews, pp. 221-270, 
June 1 987, the disclosure of which is hereby incorporated by reference). 
50 in general, DNA from other archaebacterium can be isolated using the method described above. As with 

T. Htoralis The archaebacterium DNA once isolated can be used to construct genomic libraries as either random 
fragments or restriction enzyme fragments. The latter approach is preferred. This approach generally entails 
cutting the target genomic DNA with various restriction enzymes and probing the fragments so formed with, for 
example, a T. Htoralis DNA probe. A library is thereafter formed from one or more of the enzymes which produce 
55 a single hybridization band and which are about 4Kb or large enough to at least code for the molecular weight 
of the target DNA polymerase. 

Although methods are available to screen both plasmids and phage using antibodies or DNA probes (Young 
and Davis, PNAS (1983) 80:1194-1198; Maniatis et al, supra) in accordance with the present invention it has 
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been found the phage systems tend to work better and are therefore preferred for the first libraries. 

Genomic libraries can be screened using the colony or plaque hybridization procedure (Maniatis, et al. su- 
pra) or using the antibody plaque DNA procedures. In the colony or plaque hybridization procedure, DNA probes 
may be formed by labelling a polymerase gene from a related organism, for example, T. litoralis. The genomic 
5 library is hybridized with labeled probe under conditions which depend on the stringency desired, which may 
be experimentally determined in each case as described below. > 

Specifically, although each archaebacterium will require its own set of hybridization conditions, in order to 
maximize the detectability of the target DNA, several basic approaches can be followed. Optimum hybridization 
conditions and probes can be determined for each target archaebacterium, for example, by performing test 
10 Southern blots at various temperatures. Hybridization is typically carried out in 4X SET, 0.1 M sodium phos- 
phate, pH 7.0, 0.1% Na pyrophosphate, 0.1% SDS, 1X Denhardts solution (Maniatis, supra). Probe selection, 
can also vary with respect to size and regions of the T. litoralis DNA polymerase gene (Fig. 6). Optimum probes 
can be determined for a target archaebacterium by performing test Southern blots as described above with large 
or small DNA fragments, or even oligomers. One could, for example, select probes that are totally within one 
15 of the intervening sequences of 7*. litoralis to screen for intervening sequences in the target archaebacterium's 
DNA polymerase gene, or such probes could be limited to mature polymerase coding regions. 

In general, the DNA probe could be the entire sequence of Figure 6, or a portion thereof. The DNA probe 
should be at least 20 nucleotides in length, preferably at least about 50 nucleotides in length, most preferably 
at least about 150 nucleotides in length. Three such DNA probes which may be used are the 1.3 kb fragment 
20 (nucleotides 1 to 1274 of Figure 6), the 1.6 kb fragment (nucleotides 1269 to 2856 of Figure 6), and the 1.9 kb 
fragment (nucleotides 2851 to 4771 of Figure 6). 

As with 7; litoralis, the DNA coding for the target archaebacterium DNA polymerase may also be obtained 
using an antibody/plaque procedure. When genomic expression libraries are screened using the antibody/pla- 
que procedure, since it is uncertain whether the target archaebacterium's control regions will function in E. co//, 
25 phage vectors which supply ail necessary expression control regions such as Xgt11 and XZap II are preferred 
for antibody screening. By cloning archaebacterium DNA into an appropriate site such as the EcoR I site of 
Xgt11, the archaebacterium's DNA polymerase may be expressed either as a fusion protein with beta- 
galactosidase in Xgt11 and XZaplI or from its own endogenous promoter. 

Once formed, the expression libraries can be screened either with anti-archaebacterium DNA polymerase 
30 antiserum from the target archaebacterium or, by antibody against the DNA polymerase of a closely related 
organism (i.e. T. litoralis, another extreme thermophile) using standard antibody/plaque procedures such as 
those described by Young and David, PNAS (1983), supra. 

Using either procedure, the archaebacterium DNA polymerase DNA, coding for part or the whole gene, 
once identified can then be subcloned in, for example, pBR322, pBluescript, M13 or pUC19. If desired, the DNA 
35 sequence can be determined by, for example, the Sanger dideoxy chain-terminating method (Sanger, F., Nick- 
len, S. & Coulson, A.R. PNAS (1977) 74:5463-5467). 

Identification of the DNA Encoding the DNA Polymerase 

40 Once the genomic DNA expression library has been constructed and the target DNA coding for the arch- 

aebacterium DNA has been identified by use of DNA probes or antibody cross-reactivity from T. litoralis, one 
may confirm that a DNA polymerase sequence has been obtained as described above for T. litoralis. The re- 
sulting clone may be sequenced by standard methods such as by Sanger dedioxy sequencing. 

45 ' Identification, Location and Removal of Intervening Sequencing and Overexpression of the DNA Polymerase 

In accordance with another aspect of the present invention, it has been found that the DNA coding for DNA 
polymerases from other archaebacterium also contain one or more intervening nucleotide sequences or introns. 
Moreover, it has been found that not only do such introns share substantial homology with the introns found in 

so T. litoralis, they appear to be located in the same positions. More specifically, in accordance with the present 
invention, introns have been identified in the Pol a conserved region motifs in both T. litoralis and Pyrococcus 
sp. DNA polymerase genes. Without wishing to be bound by theory, it is believed that other archaebacteria also 
possess one or more intervening sequences in the coding region for their DNA polymerases! These introns 
can be identified in two ways. If the intron(s) is related to the intron(s) located in T. litoralis and/or Pyrococcus 

55 sp. DNA polymerases genes, they can be identified by low stringency hybridization to DNA probes derived from 
the intron sequences of T. litoralis or Pyrococcus sp. DNA polymerase genes. Secondly, once the archaebac- 
terium DNA polymerase gene has been identified and isolated as described above, its DNA polymerase gene 
can be sequenced at the DNA level and the sequence compared to (1) other DNA polymerases to identify non- 
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similar segments, or (2) conserved motifs to look for the absence of one or more of Regions l-VI, followed by 
identification of interruption points in the Region(s) which are absent 

Once identified, the intron(s) can be removed in vitro by, for example, the techniques described above and 
in the Examples for removal of the two introns in the T Htoraiis DNA polymerase gene. 
5 The following examples are given to illustrate embodiments of the present invention as it is presently pre- 

ferred to practice. It will be understood that the examples are illustrative, and that the invention is not to be 
considered as restricted except as indicated in the appended claims. 

EXAMPLE I 

10 

PURIFICATION OF A THERMOSTABLE DNA POLYMERASE FROM THERMOCOCCUS LITORALIS 

T. litoralis strain NS-C (DSM No. 5473) was grown in the media described by Belkin, et al. supra, containing 
1 0 g/I of elemental sulfur in a 1 00 liter fermentor at its maximal sustainable temperature of approximately 80°C 

15 for two days. The cells were cooled to room temperature, separated from unused sulfur by decanting and col- 
lected by centrifugation and stored at -70°C. The yield of cells was 0.8 g per liter. 

183 g of cells obtained as described above, were suspended in 550 ml buffer A (10 mM KP0 4 buffer, pH 
7.4; 1.0 mM EDTA, 1.0 mM beta-mercaptoethanol) containing 0.1 M NaCI and sonicated for 5 minutes at 4°C. 
The lysate was centrifuged at 15,000 g for 30 minutes at 4°C. The supernatant solution was passed through 

20 a 470 ml Affigel blue column (Biorad). The column was then washed with 1000 ml of buffer A containing 0.1 M 
NaCI. The column was eluted with a 2000 ml linear gradient from 0,1 to 2.0 M NaCI in buffer A. The DNA poly- 
merase eluted as a single peak at approximately 1.3 M NaCI and represented 80% of the activity applied. The 
peak activity of DNA polymerase (435 ml) was dialyzed against 4 liters of buffer A, and then applied to 80 ml 
Phosphocellulose column, equilibrated with buffer A containing 0.1 M NaCI. The column was washed with 1 60 

25 ml of buffer A containing 0.1 M NaCI, and the enzyme activity was eluted with 1000 ml linear gradient of 0.1 to 
1.0 M NaCI in buffer A. The activity eluted as a single peak at 0.6 M NaCI and represented 74% of the activity 
applied. The pooled activity (150 ml) was dialyzed against 900 ml of buffer A and applied to a 42 ml DNA-cel- 
lulose column. The column was washed with 84 ml of buffer A containing 0.1 M NaCI, and the enzyme activity 
eluted with a linear gradient of buffer Afrom 0.1 to 1 .0 M NaCI. The DNA polymerase activity eluted as a single 

30 peak at 0.3 M NaCI, and represented 80% of the activity applied. The activity was pooled (93 ml). The pooled 
fractions were dialyzed against 2 liters of buffer A containing 0.05 M NaCI and then applied to a 1.0 ml HPLC 
mono-Q column (Pharmacia). The DNA polymerase activity was eluted with a 100 ml linear gradient of 0.05 M 
to 1.0 M NaCI in buffer A. The DNA polymerase activity eluted as a single peak at 0.1 M NaCI and represented 
16% of the activity applied. The pooled fractions (3.0 ml) were diluted to 6 ml with buffer A and applied to an 

35 1.0 mi HPLC mono-S column (Pharmacia) and eluted with a 100 ml linear gradient in buffer Afrom 0.05 to 1,0 
M NaCI. The activity eluted as a single peak at 0.19 M NaCI and represented 75% of the activity applied. 

By SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and subsequent staining of the proteins using a 
colloidal stain (ISS Problue) more sensitive than Coomassie Blue (Neuhoff, et al., Electrophoresis (1988) 9:255- 
262), it was determined that the DNA polymerase preparation was approximately 50% pure: two major bands 

40 were present, one at 90,000 to 95,000 daltons and a doublet at 18,000 daltons. Figure No. 1A. Avery minor 
band was evident at approximately 80,000 to 85,000 daltons. At this level of purification the polymerase had a 
specific activity of between 30,000 and 50,000 units of polymerase activity per mg of polymerase protein. On 
a separate SDS-polyacrylamide gel verification of the identity of the stained band at 90,000 to 95,000 daltons 
was obtained by cutting the gel lane containing the' purified T. litoralis polymerase into 18 slices. Embedded 

45 ' proteins were eluted from the gel by crushing the gel slices in a buffer containing 0.1% SDS and 100u.g/ml BSA. 
The eluted proteins were denatured by exposure toguanidine HCI, then renatured via dilution of the denaturant 
as described by Hager and Burgess Analytical Biochemistry (1980) 109:76-86. Polymerase activity as meas- 
ured by incorporation of radioactivity labeled 32 P-dCTP into acid-insoluble DNA (as previously described) and 
assayed for exonuclease activity (as measured by the release of 3 H-labeIIed DNA to an acid soluble form as 

so described in Example V). As shown in Figure No. 1B, only the 90,000 to 95,000 daltons band alone showed 
either significant polymerase activity or exonuclease activity. 

The DNA polymerase preparation was dialyzed against buffer A containing 0.05 M NaCI. As was determined 
by SDS-PAGE, much of the 1 8,000 dalton protein precipitated out of the solution. The yield of 7: litoralis DNA 
polymerase was determined to be 0.5 mg by quantitative protein analysis, and this represented 6.5% of the 
55 total activity present in the starting crude extract. 

Purified T. litoralis polymerase was electro phoresed and stained with either Coomassie Blue or the colloidal 
stain (ISS Problue) previously described to detect protein. One deeply staining protein band was seen at about 
90,000 to 95,000 daltons; this molecular weight determination was obtained by comparison on the same gel 
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to the migration of the following marker proteins (Bethesda Research Laboratories): myosin, 200,000 daltons; 
phosphoryiase B, 97,400 daltons; BSA, 68,000 daltons; ovalbumin, 43,000 daltons, carbonic anhydrase 29,000 
daltons; b-lactoglobulin, 18,400 daltons; lysoyzme 14,300 daltons. 

5 EXAMPLE II 

CLONING OF T. LITORAUS DNA POLYMERASE GENE . 

A. PRODUCTION OF MOUSE ANTI-I LtTORAUS DNA POLYMERASE ANTISERA 

10 

Immunization of Mice 

A 3 ml solution containing 0.4 mg of polymerase protein (obtained by the method of Example I) was con- 
centrated at 4°C to approximately 0.3 ml and used to inoculate two mice. The purified 7: litoralis polymerase 
15 preparation consisted of four bands of approximately 85-95, 75-85, and a doublet of 1 0-25 kDal on Coomassie 
blue stained SDS-PAGE gels. As shown in Example I, the T. litoralis polymerase is approximately 90-95 kDal. 
Both T. litoralis polymerase antisera recognize all four proteins present in the immunogen. 

The immunization schedule was as follows: mouse one was immunized intra peritioneally (IP) with 20 \xg 
of T. litoralis polymerase, prepared as above, in Freunds' complete adjuvant (FCA). Seven days later, both mice 
20 were immunized IP with 50 jag T. litoralis polymerase in FCA. Twenty-seven days later both mice were immun- 
ized IP with 30 ng T. litoralis polymerase for mouse one and 50 u.g T. litoralis polymerase for mouse two in 
Freunds' incomplete adjuvant. Mouse one was bled two weeks later and mouse two was bled 20 days later. 
Sera was prepared from blood by standard methods (Harlow and Lane, Antibodies: A Laboratory Manual, 
1988). 

25 Anti-7! litoralis polymerase antisera was diluted in TBSTT (20 mM Tris pH 7,5, 150 mM NaCI, 0.2% Tween 

20, and 0.05% Triton-X 100) containing 1% BSA, 0.1% NaAzide, 0.1% PMSF. 

Preabsorption of Anti-I litoralis Polymerase Antiserum Against E. coli Lysates 

30 Since most sera react with E. coli proteins, T. litoralis polymerase antisera were preabsorbed, using the 
following method, against E. coli proteins to reduce background reactivity when screening libraries or recom- 
binant antigens. E. coli cell paste was thawed and lysed by sonication and soluble protein was bound to Affigel 
10 (Biorad) as described by the manufacturer. 4 ml of E. coli resin were washed two times in TBS (TBSTT with- 
out detergents). 0.35 ml of sera was diluted approximately 1 to 5 in TBSTT, 1 % BSA, 0.1% NaAzide and mixed 

35 with resin overnight at 4°C. The resin was pelleted by centrifugation and washed. The recovered preabsorbed . 
sera was at a 1 to 17 dilution and was stored frozen at -20°C until use. 

For screening, preabsorbed sera was diluted as above to a final concentration of 1:200. 

B. IDENTIFICATION OF A PROBE FOR THE T. LITORAUS POLYMERASE GENE 

40 . " . . 

Construction of a Lambda gtll Expression Library 

A probe for the T. litoralis polymerase gene was obtained following immunological screening of a lambda 
gtll expression library. 

45 ' T. litoralis DNA was partially digested as follows: four ng of T. litoralis DNA was digested at 37°C with five 
units of Eco Rl in a 40 uJ reaction using Eco Rl buffer (Eco Rl buffer = 50 mM NaCI, 100 mM Tris pH 7.5, 20 
mM MgCI 2 , 10 mM BME). Three uJ of 100 mM EDTA was added to 15 nl samples at 30, 45 and 60 minutes. 2 
l*g of T. litoralis DNA was digested for 90 minutes at 37°C with 20 units of Eco Rl in 20 nl reaction using Eco 
Rl buffer and the reaction was stopped by adding 2 ui of 1 00 mM EDTA. 0.2 \xg of each digest was electro- 

50 phoresed on an agarose gel to monitor the extent of digestion. Approximately 3 ng of T. litoralis DNA Eco Rl 
partials (14 \i\ from the 60-minute digest and 19 ul from the 90-minute digest) were pooled to form the "Eco Rl 
poor and heated at 65°C for 1 5 minutes. 

0.5 ul of the Eco Rl pool were ligated to 0.28 ug of Eco Rl cut, bacterial alkaline phosphatase treated lambda 
gtll DNA in a five ul reaction using standard ligation buffer (ligation buffer = 66 mM Tris pH 7.5, 1 mM ATP, 1 

55 mM spermidine, 10 mM MgCI2, 15 mM DTT, and 2 mg/ml gelatin) and 0.5 ul T4 DNAIigase (New England Bio- 
labs No. 202). The ligation was performed at 16°C overnight. 4 ul of this ligation reaction were packaged using 
Gigapack Gold (Stratagene) according to the manufacturers instructions. After incubation at room temperature 
for two hours, the packaged phage were diluted in 500 ul of SM (SM = 100 mM NaCI, 8 mM MgS0 4 , 50 mM 
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Tris pH 7.5, 0.01% gelatin) plus three drops chloroform. The packaged Eco Rl library was called sample, V6-1 
and consisted of 1.1 x 10 s individual phage. B. coli strain ER1578 was used for phage infection. 

Immunological Screening of Lambda gtll Expression Library 

5 

The initial phage library was screened (Young, R.A. and R.W. Davis Science, (1 983) 222:778-782) with a 
1 :200 dilution of the antiserum produced above. 36 phage (V10-22 through V10-55) which reacted with the anti- 
T. litoralis DNA polymerase antiserum were picked and 16 phage were plaque purified. 

The 1 6 antibody positive phage were used to lysogenize E. co//K-1 2 strain Y1 089. Lysogens were screened 
10 for thermostable DNA polymerase activity, no activity was detected. 

Western blots (Towbin, et al. t PNAS, (1979) 76:4350-4354) from these 16 lysates were probed with anti- 
T. litoralis polymerase antiserum. All proteins from these lysates which reacted with T. litoralis polymerase an- 
tiserum were smaller than T. litoralis polymerase, and were also smaller than beta-galactosidase, indicating 
that none were fusion proteins with beta-galactosidase. 
15 Eight of the 16 antibody positive phage were used to affinity purify epitope-specific antibodies from total 

antiserum (Beall and Mitchell, J. Immunological Methods , (1986) 86:217-223). 

The eight affinity purified sera were used to probe Western blots of both purified T. litoralis polymerase and 
T. litoralis crude lysates. Antibody purified from NEB 618 plaques specifically reacted with T. litoralis polymerase 
in purified and Z litoralis crude lysates. This was strong evidence that phage NEB 618 encodes approximately 
20 38 kDal of the amino terminus of the 7^ litoralis polymerase. 

Characterization of Phage NEB 618 and Subcloning of Eco Rl Inserts 

Western blot analysis indicated that phage NEB 618 synthesized several peptides ranging in size from approx- 

25 imately 15-40 kDal which bound 7: litoralis polymerase antisera. DNA from phage NEB 618 was purified from 
liquid culture by standard procedures (Maniatis, et al., supra.) Digestion of NEB 618 DNA with Eco Rl yielded 
fragments of 1.3 and 1.7 kb. An Eco Rl digest of NEB 618 DNA was ligated to Eco Rl cut pBluescript DNA. 20 
ug of pBluescriptSK+ were digested with 40 units of Eco Rl in 40ul Eco Rl buffer at 37°C for three hours, followed 
by 65° for 15 minutes. 10 ^g of NEB 618 DNA were digested with 40 units of Eco Rl in 40 ul Eco Rl buffer at 

30 37°C for 75 minutes, followed by 65°C for 15 minutes. 1.75 ug of Eco Ri cut NEB 618 DNA were ligated to 20 
ng Eco Rl cut pBluescriptSK+ with one uJ T4 DNA ligase (New England Biolabs No. 202) in 1 0 ul ligation buffer. 
The ligation was performed overnight at 16°C. JM101 CaCl competent cells (Maniatis, et al„ supra) were trans- 
formed with 5 uJ of the ligation mixture. Of 24 recombinants examined, all but one contained the 1.7 kb fragment; 
clone V27-5.4 contained the 1.3 kb T. litoralis DNA fragment. 

35 Antibodies from T. litoralis polymerase mouse antisera were affinity purified, as described above, on lysates 

from V27-5.4 (encoding the 1 .3 kb Eco Rl fragment) and V27-5.7 (encoding the 1 .7 kb Eco Rl fragment in pBlue- 
script) and reacted with Western blot strips containing either purified or crude T. litoralis polymerase. Antibodies 
selected on lysates of V27-5.4 reacted with T. litoralis polymerase in both crude and purified preparations. In 
addition, the first three amino acids from the N-terminal protein sequence of native 7^ litoralis polymerase (me- 

40 thionine-isoleucine-leucine) are the same as in the predicted open reading frame (ORF) in the V27-5.4 clone. 
From these results it was concluded that V27-5.4 encoded the amino terminal of 7: litoralis polymerase. 
The 1 .3 kb Eco Rl fragment of V27-5.4 comprises nucleotides 1 to 1274 of Figure No. 6, The insert DNA was 
large enough to encode the biggest peptides synthesized by this clone, but not the entire T. litoralis polymerase. 

45 ' C. CONSTRUCTION AND SCREENING OF T. LITORALIS SECONDARY LIBRARIES 

Antibody screening discussed above, had identified the DNA fragment coding the amino terminal half of 
the T. litoralis polymerase. In order to find a fragment large enough to code for the entire gene, restriction digests 
of 7; litoralis DNA were probed with the amino terminal half of the polymerase gene contained in clone V27- 

so 5.4. Restriction digests were performed in separate tubes using a master mix which contained 1.2 |ig of T. li- 
toralis DNA in 39 ul of restriction enzyme buffer (REB, restriction enzyme buffer = 50 mM NaCI, 10 mM Tris pH 
7.5, 20 mM MgCI2, 10 mM BME), to which 1.5-200 U of enzyme were added as followed: 1.5 U Avrll, 9 U Eael, 
10 U Nhel, 20 U Notl, 9 U Spel, 20 U Xhol, 30 U Xbal, 20 U Sad, 10 U BamHI, 20 U Clal, 20 U Hindlll, 20 U 
Pstl, 12 U Nael, 10 U Seal, 12 U Xmnl, 20 U EcoRV, 20 U Sal, 20 U Eco Rl, 200 U Eagl, 20 U Dral, 5 U Hapl, 

55 8 U Nrut, 4 U SnaBI, 8 U Stul, 10 U Bell, 8 U Bglll, 10 U Rsal, 10 U Haelll, 8 U Alul, 4 U Hindi, 10 U Pvull, 6 
U Sspl. One ul 10 mg/ml BSA was added to the Hindi digest. Ball digest was prepared as above except there 
was 0 mM NaCI in the buffer. All digest were overnight at 37°C except Bell which was incubated at50°C. Digests 
were electrophoresed on agarose gels and transferred to NC (Southern, J. Mot. BioL (1975) 98:503-517). The 
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filters were probed with radiolabeled V27-5.4 DNAand hybridization was detected by autoradiography. In. most 
digests, V27-5.4 DNA hybridized to fragments greater than 20 kb, except BamHI (approximately 14 kb) f Eco 
Rl (1 .3 kb), Hindlll (approximately 2.4, 5.4 kb), Xbal (approximately 8 kb), Clal (approximately 4.4, 5.5 kb), Ball 
(approximately 8.5 kb), Hindi (approximately 2.1 , approximately 2.4 kb), Nrul (approximately 5.5 kb), Bglll (ap- 
5 proximately 2.9 kb), Haelll (approximately ,1.3, approximately 1.4 kb) and Rsal which gave numerous small 
bands. 

Digests yielding single fragments large enough to encode the entire polymerase gene, estimated to be 2.4- 
3 kb, based on the size of the native protein, were BamHI, Xbal, and Nrul. 

10 BamHI Library 

A BamHI genomic library was constructed using lambda Dashll. Lambda Dashll is a BamHI substitution 
vector that can be used to clone 10-20 kb BamHI DNA fragments. 25-75 nanograms of T. litoralis genomic DNA 
digested with BamHI, as described above, was ligated to 0.5 \ig BamHI digested, calf intestine phosphatase 

15 treated lambda Dashll DNA in five ul of standard ligation buffer including 0.5 ul T4 DNAIigase (New England 
Biolabs No. 202). Three ul of the ligation reaction was packaged (Gigapack Plus, Stratagene) as described 
above. Plaque lifts of 8,000 plaques from the lambda Dashll library were probed with labeled gel purified 1.3 
kb Eco Rl fragment from clone V27-5.4 (Maniatis, et al., supra). 2.5% of the phage hybridized to the 1.3 kb 
Eco Rl DNA fragment, two of which were plaque purified (clones lambda NEB 619 and lambda V56-9). Both 

20 phage contained a 12-15 kb BamHI fragment which hybridized to the 1.3 kb Eco Rl fragment and contained 
the approximately 8 kb Xbal and approximately 5.5 kb Nrul fragments. The BamHI insert was subcloned into 
pBR322. Colonies containing this fragment grew very poorly and, based on the polymerase assay described 
above, failed to produce detectable levels of thermostable DNA polymerase. 

25 Xbal Library 

T. litoralis DNA digested with Xbal was cloned into the Xbal site of pUC19. Colony lifts were probed with 
radiolabeled V27-5.4 DNA. No positive clones were detected. 

The Xbal fragment from the BamHI insert in lambda NEB 619 (BamHI library above) was subcloned into 

30 the Xbal site of pUC1 9. Approximately 0.3 \ig of NEB 619 DNAdigested with BamHI was ligated to 0,1 ugpUC19 
DNA digested with BamHI using two uJ T4 DNAIigase (New England Biolabs No. 202) in 20 jil of standard lig- 
ation buffer. The ligation was incubated overnight at 16°C. CaCI 2 competent JM101 and XL-1 cells were trans- 
formed with five ^l of ligation mix and incubated overnight at 37°C (Maniatis, et al., supra). Colony lifts were 
probed with radiolabeled purified 1.3 kb Eco Rl fragment from V27-5.4 DNA. No positives were detected. Com- 

35 petent RRI cells were transformed with 10 uJ of ligation mix and incubated overnight at 30°C. Micro-colonies 
were picked and mini-plasmid preparations (boiling method, Maniatis, et al., supra) analyzed. Most of these 
clones contained the approximately 8 kb Xbal fragment. The rationale for this latter experiment was that since 
the BamHI clones grew poorly, there would be an increased chance of isolating a plasmid containing the T. 
litoralis polymerase gene from an Xbal colony that also grew slowly. Also, lower temperature of incubation re- 

40 suits in less copies of pUC1 9 plasmids per cell. These results provided evidence that the T. litoralis polymerase 
gene was toxic to E. coli. Using the polymerase activity assay described above, no thermostable polymerase 
activity was detected in these clones. Restriction analysis indicated that the Xbal clones should contain the 
entire polymerase gene. See Figure No. 2. 

45 - Nrul Libraries 

Approximately 0.3 ug of NEB 619 DNA (BamHI library above) cut with Nrul was ligated to 0.1 ug of pUC19 
DNA cut with Hindi exactly as described for the Xbal library. Again, no positives were found by hybridization 
when cells were incubated at 37°C, but when transformants were incubated at 30°C, many micro-colonies were 

so observed. The majority of these micro-colonies contained the approximately 5.5 kb Nrul insert. Using the poly- 
merase activity assay described above, no thermostable polymerase activity was detected in these colonies. 
Analysis of these colonies determined that when the direction of T. Litoralis polymerase transcription was the 
same as lacZ in pUC19, the colonies failed to grow at 37°C and were extremely unstable. 
However, colonies in which the direction of T. litoralis polymerase transcription was opposite of lacZ in pUC1 9, 

55 such as in clone Nru21, were more stable. This indicated that transcription of T. litoralis polymerase is detri- 
mental to E. coli, and may explain why it was so difficult to done the entire gene. Restriction mapping analysis 
indicated that the Nrul dones should contain the entire polymerase gene. See Figure No. 2. 
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Conclusions Concerning Direct Cloning of the Polymerase 

The T. litoralis is approximately 90-95 kDal which would require approximately 2.4-3.0 kb DNAto encode 
the entire gene. Restriction mapping analysis of the 1.3 kb Eco Rl fragment, coding for the ami no-term in us of 
5 the T. litoralis polymerase gene, found within the BamHI, Xbal and Nrul clones, discussed above, indicates that 
all three clones contain the entire polymerase gene. All of these larger clones were unstable in E. coli. Therefore, 
alternate methods, as discussed below, for cloning the polymerase were tested. 

D. CLONING THE SECOND HALF OF T. LITORALIS POLYMERASE GENE 

10 

It is believed that when the entire T. litoralis polymerase gene was cloned in E. coli while under its endo- 
genous control, mutations in the gene arose. To prevent selection of inactive mutants, the polymerase gene 
was cloned from the T. litoralis genome in 2 or more pieces which should each separably be inactive and there- 
fore not selected against. Restriction mapping of the T. litoralis genome was used to determine which restriction 

15 enzymes would produce fragments that would be appropriate for cloning the second half of the T. litoralis poly- 
merase gene. Although the above data indicates that expression of T. litoralis polymerase was toxic for E. coli, 
it was also possible that DNA sequences themselves, in or outside of the coding region, were toxic. Therefore, 
the minimum sized fragment which could encode the entire gene was determined to be the best choice. Re- 
striction analysis indicated that there was an approximately 1.6 kb Eco Rl fragment adjacent to the 3' end of 

20 the amino terminal 1 .3 kb Eco Rl fragment (see Figure No. 2) which could possibly complete the polymerase 
gene. 

Hybridization Probe for the Second Half of the T. litoralis DNA Polymerase Gene 

25 Since none of the previous clones expressed thermostable polymerase activity, it was possible that they 

had accumulated mutations in the coding sequence and would therefore not be suitable sources of the second 
half of the gene. Hybridization probes were therefore required in order to clone the downstream fragments from 
the genome. The approximately 3.2 kb Ndel/Clal fragment from clone Nru21 (the Nru21 clone contains an ap- 
proximately 5.5 kb insert, beginning approximately 300 bp upstream from the start of the polymerase gene) 

30 was subcloned into pSP73 (Promega) creating clone NCII. CaCI 2 competent RRI cells were transformed, as 
above, with the ligation mixture. Mini-plasmid preps of transformants were analyzed by digestion with Ndel and 
Clal and clone NCII containing the T. litoralis 3.2 kb Ndel/Clal fragment was identified. This clone was stable 
in E. coli. The NCII insert was sequenced (Sanger, et al., PNAS, (1977) 74:5463-5467). The Clal end was iden- 
tical to the V27-5.4 sequence (1 .3 kb Eco Rl fragment coding for the amino-terminus of the T. litoralis polymer- 

35 ase). The 1.3 kb Eco Rl junction and beyond was sequenced using primers derived from the 1.3 kb Eco Rl 
fragment sequence. The Ndel end was sequenced from primers within the vector. 

Screening of Eco Rl Genomic Libraries 

40 10 ug of NC11 were digested with 30 U of Eco Rl in 100 \i\ of Eco Rl buffer at 37°C for two hours. The 

approximately 1.6 kb Eco Rl fragment was purified on DE-81 paper (Whatman) after electrophoresis. The ap- 
proximately 1.6 kb Eco Rl fragment was radiolabeled and used to probe the original Eco Rl lambda gtll library. 
Infection and plaque lifts were performed as above. Three positives were identified and plaque purified. All con- 
tain the approximately 1.6 kb Eco Rl fragment, but some also contain other inserts. 

45 - An Eco Rl library was also constructed in lambda Zapll. 2 ug of T. litoralis DNA were digested with 20 U 
Eco Rl for five hours at 37 g C in 20u.l Eco Rl buffer and then heat treated at 65°C for 15 minutes. Approximately 
15 nanograms of T. litoralis DNA/Eco Rl was ligated to 0.5 u>g of Eco Rl cut, phosphatased lambda Zapll DNA 
(Stratagene) with 0.5 uJ T4DNA ligase (New England Biolabs No. 202) in 5 uJ of ligation buffer at 1 6°C overnight. 
4 uJ of ligated DNA was packaged (GigaPack Gold, Stratagene). Infection and plaque lifts were performed as 

50 above. Approximately 1,500 phage were probed with radiolabeled approximately 1.6 kb Eco Rl fragment as 
above. Five hybridization positive plaques were picked and three were plaque purified. Two phage (NEB 620 
and V109-2) were rescued as pBluescript recombinants (V117-1 and V117-2) by in-vivo excision according to 
the manufacturer's instructions (Stratagene). Both contained the approximately 1 .6 kb Eco Rl fragment plus 
different second fragments. The 5' end was sequenced and corresponds to the sequence determined from 

55 NC11 (Clal/Ndel fragment). See Figure No. 2. This Eco Rl fragment contains 3/6 of the T4 DNA polymerase 
family homology islands as described by Wang, et al., supra. The 1 .6 kb Eco Rl fragment comprises nucleotides 
1269 to 2856 of Figure No. 6. 

The sequence of the 1 .6 kb Eco Rl and Clal/Ndel fragments indicated that the 1 .9 kb Eco Rl fragment may 
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be necessary to complete the polymerase gene. Lambda Zapll phage, V110-1 through V110-7, containing the 
1 .9 kb Eco Rl fragment were identified as described above for NEB 620 using labeled probes. Two phage (V1 1 0- 
2 and V110-4) were rescued as pBluescript recombinants (V153-2 and V153-4) by in-vivo excision according 
to the manufacturers instructions (Stratagene). Both contained the approximately 1.9 kb Eco Rl fragment plus 

5 different second fragments. The 1.9 kb Eco Rl fragment had sequence identity with the overlapping region in 
NC11. The 1 .9 kb Eco Rl fragment comprises nucleotides 2851 to 4771 of Figure No. 6. 

The entire T. litoralis polymerase gene has been cloned as BamHI, Xbal and Nrul fragments which were 
unstable and from which the active enzyme was not detected. The gene has also been cloned in four pieces 
(1 .3 kb Eco Rl fragment, approximately 1 .6 kb Eco Rl fragment, approximately 1 .9 kb Eco Rl fragment and an 

10 Eco RI/BamHI fragment containing the stop codon). The 1 .3 kb Eco Rl fragment stably expresses the amino 
terminal portion of the polymerase. 

EXAMPLE III 

15 CLONING OF ACTIVE T. LITORALIS DNA POLYMERASE 

The T. litoralis polymerase gene found on the 14 kb BamHI restriction fragment of bacteriophage NEB619 
(ATCC No. 40795), was sequenced using the method of Sanger, et al. t PNAS (1977) 74:5463-5467. 5837 bp 
of continuous DNA sequence (SEQ ID NO:1) was determined beginning from the 5' end of the 1.3 kb EcoRI 

20 fragment (position NT 1), see Figure No. 6. 

From analysis of the DNA sequence, it was determined that the polymerase gene begins at NT 291 in the 
1 ,3 kb EcoRI fragment. A translation termination site beginning at NT 5397 was also located. Since the apparent 
molecular weight of T. litoralis polymerase was approximately 90-95 Kdal, it was predicted that the gene should 
be -2900 bp. Instead, a 5 1 06 bp open reading frame (ORF) was identified with a coding capacity of 1 702 amino 

25 acids (aa) or -185 Kdal. 

By sequence homology with other DNA polymerases, an example of which is set out in Figure No. 7, it was 
discovered that the T.-litoralis polymerase gene was interrupted by an intron or intervening sequence in DNA 
polymerase consensus homology region III (hereinafter "I VSI") (Wang, T., et al., FASEB Journal { 1984) 3:14- 
21 the disclosure of which is herein incorporated by reference). The conserved amino acids of the consensus 

30 DNA polymerase homology region III are shown in Figure No. 7. In the Figure, the conserved amino acids are 
underlined. As can be seen in Figure No. 7, the left side of the T. litoralis homology island III (SEQ ID NO:2) 
begins at NT 1737, and homology to the consensus sequence is lost after the Asn and Ser residues. The right 
side of the T. litoralis homology island III (SEQ ID NO:3) can be picked up at NT 3384, at the Asn and Ser re- 
sidues. When the two T. litoralis polymerase amino acid sequences were positioned so that the Asn and Ser 

35 residues overlap, as in Figure No. 7, it was evident that a good match to the DNA polymerase homology region 
III existed. 

Using the homology data, it was therefore predicted that an intervening sequence existed in the T. litoralis 
DNA separating the left and right halves of the DNA polymerase homology region 111. 

In one preferred embodiment, the intervening sequence was deleted by identifying unique restriction en- 
40 zyme sites in the coding region which were near the intervening sequence splice junction. A synthetic duplex 
oligonucleotide was synthesized, and used to bridge the gap between the two restriction fragments. A multi- 
part sequential ligation of the carboxy end restriction fragments, the bridging oligonucleotide, the amino end 
restriction fragment, and the expression vector, resulted in the formation of an expression vector containing an 
intact polymerase gene with the intervening sequence deleted. 
45 ' Specifically, the DNA fragments or sequences used to construct the expression vector of the present in- 
vention containing the T. litoralis DNA polymerase gene with the intervening sequence deleted were as follows: 
1 . An Ndel site was created by oligonucleotide directed mutagenesis (Kunkel, et al. t Methods in Enzomol- 
ogy (1987) 154:367:382) in plasmid V27-5.4 (Example II, Part B) such that the initiation codon of the poly- 
merase coding region is contained within the Ndel site. 

50 

Original sequence . . . TTT ATG . . . 

(nucleotides 23S-Z93) 

55 New sequence - . • CAT ATG . . . 

Sequences from the newly created Ndel site to the Clal site (approximately 528 base pairs) were util- 
ized in the construction of the expression vector. 
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2. An approximately 899 bp sequence between the Clal and Pvui site of NC11 (Example II, Part D). 

3. Asynthetic duplex which spans the intervening sequence, connecting Pvul and Bsu36l sites derived from 
other fragments, as set out in Figure No. 12. 

In Figure No. 12, the first line indicates the original sequence at the 5' end of the splice junction (nu- 
5 cteotides 1721-1784, SEQ ID NO:1), the second line indicates the original sequence of the 3' end of the 

splice junction (nucleotides 3375-3415, SEQ ID NO:1), and the third and fourth lines indicate the sequence 
of the synthetic duplex oligonucleotide. 

4. A Bsu361 to BamHI fragment, approximately 2500 base pairs, derived from bacteriophage NEB 619 (Ex- 
ample II, Part C). 

10 5. A BamHI to Ndel fragment of approximately 6200 base pairs representing the vector backbone, derived 

from pET11c (Studier, Methods in Enzomology, (1990) 185:66-89), and which includes: 

a) The T7 phi 10 promoter and ribosome binding site for the gene 10 protein 

b) Ampicillin resistance gene 

c) lad 1 * gene 

15 d) Plasmid origin of replication 

e) Afour-fold repeat of the ribosomal transcription terminators (rrnb), Simons, etal., Gene (1987) 53:85- 
96. 

The above DNA fragments, 1-5, were sequentially ligated under appropriate conditions using T4 DNAIig- 
ase. The correct construct was identified by restriction analysis and named pPR969. See Figure No. 8. pPR969 
20 was used to transform E. coli strain RRI, creating a strain designated NEB 687. A sample of NEB 687 was de- 
posited with the American Type Culture Collection on December 7, 1990 and bears ATCC No. 68487. 

In another preferred embodiment, the 7: Htoralis polymerase gene, with the intervening sequence deleted, 
was cloned into a derivative of the Studier T7 RNA polymerase expression vector pET11c (Studier, (1990) su- 
pra). The recombinant plasmid V174-1B1 was used to transform £ coli strain BL21(DE3)pLysS, creating strain 
25 175-1B1, designated NEB671. See Figure Nos. 5 and 10. 

A sample of NEB671 was deposited with the American Type Culture Collection on October 17, 1990 and 
bears ATCC No. 68447. 

A comparison between the predicted and observed molecular weights of the polymerase, even with the 
IVS1 deleted, revealed a discrepancy. The predicted molecular weight of the polymerase after removal of IVS1 

30 in region 111 is 132 Kb, while the observed molecular weight of either the native (see Example I) or recombinant 
(see Example IV) polymerase is about 95 kD. The molecular weight discrepancy is due to an intron (hereinafter 
,, IVS2 W ) in homology region I. This finding is based on the following observations: The distance between hom- 
ology regions 111 and I varies from 15-135 amino acids in members of the pol alpha family (Wang, (1989) supra). 
In 7". Htoralis there are 407 amino acids or -44-kD separating these regions. T. Htoralis DNA polymerase is very 

35 similar to human pol alpha except for 360 amino acids between conserved homology regions I and III where 
no similarity exists. Finally, no consensus region I is observed. 

In addition, as determined by SDS-PAGE, a thermostable endonuclease of approximately 42-47 kD is also 
produced by the T. Htoralis DNA polymerase clones of the present invention (see Example X). This endonu- 
clease was purified to homogeneity by standard ion exchange chromatography, and was sequenced at its ami- 

40 no-terminal. The first 30 amino acids of the endonuclease correspond to the amino acids encoded beginning 
at nucleotide 3534 of the polymerase clone (SEQ ID NO:1). This corresponds to the portion of the polymerase 
which lacks homology with other known polymerases This endonuclease does not react with anti-T Htoralis 
DNA polymerase antisera. While the exact mechanism by which the endonuclease is spliced out of the poly- 
merase is unknown, it occurs spontaneously in both £ coli and T. Htoralis. 

45 ' ' 

EXAMPLE IV 

PURIFICATION OF RECOMBINANT T. HTORALIS DNA POLYMERASE 

50 £ coli NEB671 (ATCC No. 68447) was grown in a 100 liter fermentor in media containing 10 g/liter tryptone, 
5 g/liter yeast extract, 5 g/liter NaCI and 100 mg/liter ampicillin at 35°C and induced with 0.3 mM IPTG at mid- 
exponential growth phase and incubated an additional 4 hours. The cells were harvested by centrifugation and 
stored at -70°C. 

580 grams of cells were thawed and suspended in Buffer A (100 mM NaCI, 25 mM KP0 4 at pH 7.0, 0.1 
55 mM EDTA, 0.05% Triton X-100 and 1 0% glycerol) to a total volume of 2400 ml. The cells were lysed by passage 
through a Gaulin homogenizer. The crude extract was clarified by centrifugation. The clarified crude extract 
volume was adjusted to 2200 mis with the above buffer and was heated to 75°C for 30 minutes. The particulate 
material was removed by centrifugation and the remaining supernatant contained about 3120 mg of soluble 
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protein. 

The supernatant was applied to a DEAE-sepharose column (5 x 13 cm; 255 ml bed volume) linked in series 
to a phosphocellulose column (5 x 11 cm; 216 ml bed volume). The DEAE-sepharose flow-through fraction, 
containing the bulk of the enzyme, passed immediately onto the phosphocellulose column. Both columns were 
5 washed with 300 mis Buffer A, the two columns were disconnected, and the protein on the phosphocellulose 
. column was eluted with a 2 liter linear gradient of NaCI from 0.1 M to 1 M formed in Buffer A. 

The column fractions were assayed for DNA polymerase activity. Briefly, 1-4 uJ of fractions were incubated 
for 5-10 minutes at 75°C in 50 ul of 1X T. litoralis DNA polymerase buffer (10 mM KCI, 20 mM Tris-HCl (ph 8.8 
at24°C), 10 mM (NH 4 ) 2 S0 4 , 2 mM MgS0 4 and 0.1% Triton X-100) containing 30 uM each dNTP and 3 H-labeled 
10 TTP, 0.2 mg/ml activated calf thymus DNA and 100 ug/ml acetylated BSA, although it has been found that non- 
acetylated BSA is preferred. The mixtures were applied to Whatman 3 mm filters and the filters were subjected 
to three washes of 10% TCA followed by.two washes of cold isopropanol. After drying of the filters, bound ra- 
dioactivity representing incorporation of 3 H-TTP into the DNA was measured. The active fractions were pooled 
and^he enzyme activity levels in each pool were assessed using the above assay conditions except the dNTP 
15 level was raised to 200 uM each dNTP. Under these conditions one unit of enzyme activity was defined as the 
amount of enzyme that wilt incorporate 10 nmoles of dNTP into acid-insoluble material at 75°C in 30 minutes. 

The active fractions comprising a 300 ml volume containing 66 mg protein, were applied to a hydroxylapatite 
column (2.5 x 5 cm; 25 ml bed volume) equilibrated with Buffer B (400 mM NaCI, 10 mM KP0 4 at pH 7.0, 0.1 
mM EDTA, 0.05% Triton X-1 00 and 1 0% glycerol). The protein was eluted with a 250 m! linear gradient of KP0 4 
20 from 10 mM to 500 mM formed in Buffer B. The active fractions, comprising a 59 ml volume containing 27 mg 
protein, was pooled and dialyzed against Buffer C (200 mM NaCI, 10 mM Tris-HCl at pH 7.5, 0.1 mM EDTA, 
0.05% Triton X-100 and 10% glycerol). 

The dialysate was applied to a heparin-sepharose column (1.4 x 4 cm; 6 ml bed volume) and washed with 
20 ml Buffer C. A 100 ml linear gradient of NaCI from 200 mM to 700 mM formed in Buffer C was applied to the 
25 column. The active fractions, comprising a 40 ml volume containing 16 mg protein was pooled and dialyzed 
against Buffer C. 

The dialysate was applied to an Affi-gel Blue chromatography column (1 A x 4 cm; 6 ml bed volume), wash- 
ed with 20 ml Buffer C, and the protein was eluted with a 95 ml linear gradient from 0.2 M to 2 M NaCI formed 
in Buffer C. The active fractions, comprising a 30 ml volume containing 11 mg of protein, was dialyzed against 
30 a storage buffer containing 200 mM KCI, 10 mM Tris-HCl (pH 7.4), 1 mM DTT, 0.1 mM EDTA, 0.1% Triton X- 
100, 100 ug/ml BSA and 50% glycerol. 

The T. litoralis DNA polymerase obtained above had a specific activity of 20,000-40,000 units/mg. 

Characterization of recombinant T. litoralis polymerase 

35 

Recombinant and native T. litoralis polymerase had the same apparent molecular weight when electrophor- 
esed in 5- 10% SDS-PAGE gradient gels. Recombinant T. litoralis polymerase maintains the heat stability of 
the native enzyme. Recombinant T. litoralis polymerase has the same 3' — >5' exonuclease activity as native 
T. litoralis polymerase, which is also sensitive to inhibition by dNTPs. 

40 

EXAMPLE V 

OVER-EXPRESSION OF THE THERMOCOCCUS LITORALIS DNA POLYMERASE GENE 

45 < The 7: litoralis DNA polymerase gene, with IVS1 deleted, e.g., V174-1B1 obtained in Example III, may be used 
in a number of approaches, or combinations thereof, to obtain maximum expression of the cloned T. litoralis 
DNA polymerase. 

One such approach comprises separating the T. litoralis DNA polymerase gene from its endogenous control 
elements and then operably linking the polymerase gene to a very tightly controlled promoter such as a T7 ex- 

50 pression vector (Rosenberg, et al., Gene (1987) 56:125-1 35). Insertion of the strong promoter may be accom- 
plished by identifiying convenient restriction targets near both ends of the T. litoralis DNA polymerase gene and 
compatible restriction targets on the vector near the promoter, or generating restriction targets using site di- 
rected mutagenesis {Kunkel, (1984), supra), and transferring the T. litoralis DNA polymerase gene into the vec- 
tor in such an orientation as to be under transcriptional and translational control of the strong promoter. 

55 T. litoralis DNA polymerase may also be overexpressed by utilizing a strong ribosome binding site placed 

upstream of the T. litoralis DNA polymerase gene to increase expression of the gene. See, Shine and Dalgamo, 
Proc. Natl. Acad. Sci. USA (1974) 71:1342-1346, which is hereby incorporated by reference. 

Another approach for increasing expression of the T. litoralis DNA polymerase gene comprises altering the 
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DNA sequence of the gene by site directed mutagenesis or resynthesis to contain initiation codons that are 
more efficiently utilized than E. coli 

Finally, 7". litoralis DNA polymerase may be more stable in eukaryote systems like yeast and Baculovirus. 
The £ IHoralis DNA polymerase may be produced from clones carrying the T. litoralis DNA polymerase gene 
5 by propagation in a fermentor in a rich medium containing appropriate antibiotics. Cells are thereafter harvested 
by centrifugation and disrupted by sonication to produce a crude cell extract containing the T. litoralis DNA poly- 
merase activity. 

The crude extract containing the 7". litoralis DNA polymerase activity .is purified by the method described 
in Example I, or by standard product purification techniques such as affinity-chromatography, or ion-exchange 
10 chromatography. 

EXAMPLE VI 

PRODUCTION OF A T. LITORALIS DNA POLYMERASE 3' TO 5' EXONUCLEASE MUTANT 

15 

T. litoralis DNA polymerase lacking 3' to 5' exonuclease activity was constructed using site-directed mu- 
tagenesis to alter the codons for asp141 and glu143 to code for alanine. Site-directed mutagenesis has been , 
used to create DNA polymerase variants which are reported to have reduced exonuclease activity, including 
phi29 (Cell (1989) 59:219-228) DNA polymerase I (Science (1988) 240:199-201) and T7 DNA polymerases 
20 (U.S. Patent No. 4,942,130). 

Site-directed mutagenesis of the polymerase of the present invention was accomplished using a modifi- 
cation of the technique described by Kunkel, T.A., PNAS (1985) 82:488-492, the disclosure of which is herein 
incorporated by reference. The V27-5.4 plasmid (see Example 2, Part B) was used to construct the site-directed 
mutants. V27-5.4 encodes the 1 .3 kb EcoRI fragment in pBluescript Sk+. E. cofi strain CJ236 (Kunkel, et al., 
25 Methods in Enymology (1987) 154:367-382), a strain that incorporates deoxyuracil in place of deoxythymidine, 
containing the V27-5.4 plasmid was superinfected with the f1 helper phage IR1 (Virology, (1982) 122:222-226) 
to produce single stranded versions of the plasmid. 

Briefly, the side-directed mutants were constructed using the following approach. First, a mutant oligonu- 
cleotide primer, 35 bases in length, was synthesized using standard procedures. The oligonucleotide was hy- 
30 bridized to the single-stranded template. After hybridization the oligonucleotide was extended using T4 DNA 
polymerase. The resulting double-stranded DNA was converted to a closed circular dsDNA by treatment with 
T4 DNA ligase. Plasmids containing the sought after mutations were identified by virtue of the creation of a 
Pvul site overlapping the changed bases, as set out below. One such plasmid was identified and named pAJG2. 
The original and revised sequences for amino acid residues are 141, 142, and 143: 
35 ... asp i 1 e glu 

Original : . . GAT ATT GAA 

. . al a ile al a 

40 Altered: . . G CG ATC G CA 

The newly created Pvul site, used to screen for the alteration, is underlined. Note that the middle codon was 
changed but that the amino acid encoded by this new codon is the same as the previous one. 

An approximately 120 bp Clal to Ncol fragment from V174-1B1 (see Example III) was replaced by the cor- 
45 - responding fragment bearing the above substitutions from pAJG2, creating pCAS4 (see Figure No. 9). pCAS4 
thus differs from V174-1 B1 by 4 base pairs, namely those described above. 

E. coli BL21 (DE3) plysS (Methods in Enzomology, (1 990) 1 85:60-89) was transformed with pCAS4, cre- 
ating strain NEB681. Expression of the mutant T. litoralis polymerase was induced by addition of IPTG. 

Asample of NEB681 has been deposited with the American Type Culture Collection on November 8, 1990, 
so and bears ATCC No. 68473. 

Relative exonuclease activities in the native T. litoralis DNA polymerase and the exonuclease minus variant 
isolated from E. coli NEB681 was determined using a uniformly PH] labeled E. coli DNA substrate. Wild type 
T. IHoralis DNA polymerase was from a highly purified lot currently sold by New England Biolabs, Inc. The ex- 
onuclease minus variant was partially purified through DEAE sepharose and phosphocellulose columns to re- 
55 move contaminants which interfered with the exonuclease assays. The indicated number of units of POLYMER- 
ASE were added to a 0.1 ml reaction containing T. litoralis DNA polymerase buffer [20 mM Tris-Hcl (pH8.8 at 
25°C), 10 mM KCI, 10 mM (NH 4 ) 2 S0 4 , 5 mM MgS0 4 , 0.1% Triton X-100], 0.1 mg/ml bovine serum albumin, 
and 3 fig/ml DNA substrate (specific activity 200,000 cpm/ng) and the reaction was overlaid with mineral oil to 
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prevent evaporation of the reaction. Identical reactions contained in addition 20 \xM dNTP, previously shown 
to inhibit the exonuclease activity of the wild type enzyme. The complete reaction mixture was incubated at 
70°C for 60 minutes, following which 0.08 ml was removed and mixed with 0.02 ml 0.5 mg/ml sonicated herring 
sperm DNA (to aid in precipitation of intact DNA) and 0.2 ml of 10% trichloroacetic acid at 4°C. After mixing, 

5 the reaction was incubated on ice for 5 minutes, and the DNA then pelleted at4°C for 5 minutes in an Eppendorf 
centrifuge. 0.25 ml of supernatant was mixed with scintillation fluid and counted. The results of the sample 
counting, corrected forbackground, are shown in Figure No. 11. 

As illustrated in Figure No. 11 , the exonuclease minus variant was substantially free of exonuclease activity 
in the presence or absence of dNTPs under conditions where the native polymerase clearly demonstrated ex- 

10 onuclease activity. Conservatively estimating that a level of activity two-fold above background could have been 
detected, this implies that the exonuclease activity is decreased at least 60-fold in this variant 

EXAMPLE VII 

15 T. UTORALIS DNA POLYMERASE HALF-LIFE DETERMINATION 

The thermostability or half-life of the T. litoratis DNA polymerase purified as described above in Example 
1 was determined by the following method. Purified T. litoralis DNA polymerase (25 units) was preincubated at 
100°C in the following buffer 70 mM tris-HCI (pH 8.8 at 25°C), 17 mM ammonium sulfate, 7 mM MgCI 2 , 10 mM 

20 beta-mercaptoethanol, 200nM each deoxynucleotide and 200 ug/ml DNAse-treated DNA. An initial sample was 
taken at time zero and a small aliquot equivalent to 5% of the enzyme mixture was removed at 10, 20, 40, 60, 
90, 120, 150, and 180 minutes. The polymerase activity was measured by determining incorporation of deox- 
ynucleotide into DNA as described previously. 

A sample of Taq DNA polymerase obtained from New England Biolabs was subjected to the above assay. 

25 An initial sample was taken at time zero and a small aliquot equivalent to 5% of the enzyme mixture was re- 
moved at 4, 7, and 10 minutes. As shown in the Figure No. 3, the half-life of the T. litoralis DNA polymerase at 
100°C was 60 minutes, while the half-life of the Taq polymerase at 100°C was 4.5 minutes. 

As shown in Figure No. 3A, the half-life of T. litoralis DNA polymerase at 1 00°C in the absence of stabilizers 
was 60 minutes, while in the presence of the stabilizers TRITON X-100 (0.15%) or BSA (100ug/ml) the half- 

30 life was 95 minutes. This was in stark contrast to the half-life of Taq DNA polymerases at 100°C r which in the 
presence or absence of stabilizers was 4.5 minutes. 

The thermostability or half-life of recombinant T. litoralis DNA polymerase purified as describe above in Ex- 
ample IV was found to have a biphasic heat inactivation curve at temperatures greater than about 90°C. These 
two phases were characterized by half-lives of about 5 minutes and 7 hours (Fig. 3B). To provide more con- 

35 sistent behavior at extreme temperatures, an additional purification step may be used to eliminate the more 
heat sensitive component of the polymerase. 

Specifically, the final enzyme preparation of Example IV was heated at 100°C for 15 minutes then cooled 
on ice for 30 minutes. Precipitated proteins were removed by centifugation at 12,000 xg for 10 minutes at 4°C. 
Approximately 20% of the initial polymerase activity was lost in this procedure. The remaining DNA polymerase 

40 showed a monophasic heat inactivation profile, with a half-life at 95°C of about 7 hours. The resulting polymer- 
. ase also showed kinetic characteristics at 75°C which were similar to the native enzyme and to the recombinant 
enzyme prepared in accordance with Example IV. 

EXAMPLE VIII 

45 ' 

DETERMINATION OF 3'-5' PROOFREADING ACTIVITY 

1. Response of T. litoralis DNA Polymerase to the Absence or Presence of Deoxynucleotides 

50 The levels of exonuclease activities associated with polymerases show very different responses to deox- 

ynucleotides. No n proofreading 5'-3* exonudeases are stimulated tenfold or greater by concomitant polymeri- 
zation afforded by the presence of deoxynucleotides, while proofreading 3'-5' exonudeases are inhibited com- 
pletely by concomitant polymerization, Lehman, I.R. ARB (1967) 36:645. 

The T. litoralis DNA polymerase or polymerases with well-characterized exonuclease functions (T4 Poly- 

55 merase, Klenow fragment) were incubated with 1 \ig 3 H-thumidine-labeled double-stranded DNA (1 0 5 CPM/^g) 
in polymerization buffer (70 mM tris (pH 8.8 at 24°C), 2 mM MgCI 2 , 0.1% Triton and 100 ug/ml bovine serum 
albumin). After an incubation period of three hours (experiment 1) or four hours (experiment 2) at either 70°C 
(thermophilic polymerases) or 37°C (mesophilic polymerases), the exonuclease-hydrolyzed bases were quan- 
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tified by measuring the acid-soluble radioactive! y-labeled bases. 

As shown in Table 1 , the Taq DNA polymerase, with its 5'-3' exonuclease activity, shows stimulation of ex- 
onuclease activity when deoxynucleotides were present at 30 uM. However, polymerases with 3' -5' proofread- 
ing exonuclease activities, such as theT4 polymerase, Klenow fragment of E. co// polymerase I, or the T. litoralis 
DNA polymerase showed the reverse, an inhibitory response to the presence of deoxynucleotides. 

The similarity of responses to the presence or absence of deoxynucleotides of the T. litoralis DNA poly- 
merase and the well-characterized Klenow fragment of the E. coti DNA polymerase is further shown in Figure 
No. 4. Twenty units of 
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either polymerase was incubated with 9 u.g 3 H-thymidine-labeled double-stranded DNA(10 5 CPM/ug) in 350 
\i\ polymerization buffer as described above in the presence, or absence of, 30 u.M deoxynucleotides. At each 
time point, 50 uJ was removed and the level of acid-soluble radioactrvely-fabeled bases were measured. As 
Figure No. 4 documents, the behavior of T. litoralis DNA polymerase and the Klenow fragment of E. coli DNA 
5 polymerase, which contains a well-characterized 3 '-5' proofreading exonuclease activity, are very similar. 

2. Response of T litoralis DNA Polymerase to Increasing Deoxy nucleotide Concentrations 

Exonuclease activities of polymerases are affected by the level of deoxynucleotides present during poly- 
10 merization, in as much as these levels affect polymerization. As deoxynucleotide levels are increased towards 
the Km (Michaelis constant) of the enzyme, the rate of polymerization is increased. For exonuclease functions 
of polymerases sensitive to the rate of polymerization, changes in exonuclease activity are parallel with increas- 
es in deoxynucleotide concentrations. The increase in polymerization rate drastically decreases proofreading 
3'-5' exonuclease activity with a concomitant increase in polymerization-dependent 5'-3' exonuclease activity. 
15 The exonuclease function of the T, litoralis DNA polymerase was compared to those of well-characterized 

exonuclease functions of other polymerases as the deoxynucleotide concentration was increased from 1 0 uM 
to 100 uM. The exonuclease activity was measured as described in (1) with an incubation period of 30 minutes. 
As summarized in Table 2, the T. litoralis DNA polymerase responded to increases in deoxynucleotide levels 
similarly to a polymerase known to possess a 3'-5* proofreading exonuclease (Klenow fragment of E. coli DNA 
20 Pol. I). This 
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response was in contradiction to that of a polymerase known not to possess this proofreading function, Taq 
DNA polymerase. This polymerase responded to an increase in deoxynucleotide levels with an increase in ex- 
onuclease function due to its 5'-3' exonuclease activity. 
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3. Response of T. litoralis DNA Polymerase to Alteration from a Balanced Deoxynucleotide State to an Un- 
balanced State 

Polymerization is dependent on equal levels of all four deoxy nucleotides present during DNA synthesis. If 
the deoxynucleotide levels are not equal, polymerases have decreased polymerization rates and are more likely 
to insert incorrect bases. Such conditions greatly increase proofreading 3-5' exonuclease activities while de- 
creasing 5-3' exonuclease activities. Lehman, I.R., ARB (1967) 36:645). 

The T. litoralis DNA polymerase was incubated with both balanced deoxynucleotide levels (30 uM) and two 
levels of imbalance characterized by dCTP present at 1/10 or 1/100 the level of the other three deoxynucleo- 
tides. The response of the T. litoralis DNA polymerase was then compared to that of three polymerases pos- 
sessing either the 3'-5* or the 5*-3' exonuclease functions. All assays were performed as described in (1) except 
the dCTP concentrations listed below. As seen in Table 3 below, the T. litoralis DNA polymerase follows the 
expected behavior for a proofreading 3-5' exonuclease-containing polymerase; an imbalance in deoxynucleo- 
tide pools increased the exonuclease activity in a similar manner as that of the proofreading polymerases of 
T4 DNA polymerase or Klenow fragment of £ colt DNA polymerase I. In contrast to this response, the exonu- 
clease of the Taq DNA polymerase was not affected until the imbalance was heightened to the point that poly- 
merization was inhibited. 
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4. Directionality of Exonuclease Activity 

A proofreading exonuclease has a 3'-5* directionality on DNA while non proofreading exonuclease associ- 
ated with DNA polymerases have a 5'-3' directionality. To discern the direction of the exonuclease activity of 

5 T. litoralis DNA polymerase, the 5* blocked DNA of adenovirus was utilized. Since the 5' end of this DNA is 
blocked by protein, enzymic activities that are 5'-3' in directionality cannot digest this double-stranded DNA; 
however, enzymic activities that are 3'-5\ such as exonuclease HI or proofreading exonuclease-containing poly- 
merases, can digest adenovirus DNA. 

Twenty-five units of exonuclease III or 20 units of either T. litoralis DNA polymerase, T4 DNA polymerase 

10 (possessing a well characterized 3'-5' exonuclease activity), or Taq DNA polymerase (lacking such an activity) 
were incubated with 5u.g adenovirus DNA for time periods up to 30 minutes duration at either 37°C (T4 poly- 
merase and exonuclease III) or 70°C (Taq polymerase and T. litoralis polymerase) in the presence of 70 mM 
tris-HCI pH 8.8 at 25°C, 2 mM MgCI 2 and 100 jig/ml BSA. At the end of each incubation time period, enzymic 
activity was stopped by phenol extraction of the adenovirus DNA, followed by Hpal digestion for one hour at 

15 37°C in 20 mM tris, pH 7.9 at 25°C, 10 mM Magnesium acetate 50 mM potassium acetate and 1 mM DTT. The 
DNA fragments were subjected to agarose gel electrophoresis and the resulting pattern of time-dependent deg- 
radation and subsequent loss of double-stranded DNA fragments were assessed. 

The 3'-5* exonuclease activities of exonuclease III, of T. litoralis DNA polymerase and T4 DNA polymerase 
caused the disappearance of the double-strand DNA fragments originating from the 5' blocked end of the ade- 

20 novirus DNA, indicating vulnerability of its 3' end. In contrast, the Taq DNA polymerase with its 5'-3' polymer- 
ization-dependent exonuclease activity, showed no disappearance of the DNA fragment. 

EXAMPLE IX 

25 PERFORMANCE OF T. litoralis DNA POLYMERASE IN THE PCR PROCESS 

The ability of the T. litoralis DNA polymerase to perform the polymerase chain reaction (PCR) was also 
examined. In 100 ui volumes containing the buffer described in Example IV, varying amounts of M13mp18 DNA 
cut by Clal digestion, generating 2 fragments of 4355 bp and 2895 bp, were incubated with 200 ng of calf thymus 

30 DNA present as carrier DNA to decrease any nonspecific adsorption effects. The forward and reverse primers 
were present at 1 nM (forward primer = 5'd (CC AG CAAG GC CG ATAGTTTG AGTT)3' and the reverse primer = 
5' d(CGCCAGGGTTTTCCCAGTCACGAC)3'). These primers flank a 1 kb DNA sequence on the 4355 bp frag- 
ment described above, with the sequence representing 14% of the total M13mp18 DNA. Also present were 200 
|iM each dNTP, 100 ng/ml BSA, 10% DM SO and 2.5 units of either T. aquaticus DNA polymerase (in the pres- 

35 ence or absence of 0.5% NP40 and 0.05% Tween 20), or 7: litoralis DNA polymerase (in the presence or ab- 
sence of 0.10% Triton X-100). The initial cycle consisted of 5 min at 95°C, 5 min at 50°C (during which poly- 
merase and BSA additions were made) and 5 min at 70°C. The segments of each subsequent PCR cycle were 
the following: 1 min at 93°C, 1 min at 50°C and 5 min at 70°C. After 0, 13, 23 and 40 cycles, 20 \i\ amounts of 
100 uJ volumes were removed and subjected to agarose gel electrophoresis with ethidium bromide present to 

40 quantitate the amplification of the 1 kb DNA sequence. 

Initial experiments with this target DNA sequence present at 28 ng and 2.8 ng established the ability of the 
7". litoralis DNA polymerase to catalyze the polymerase chain reaction; yields were comparable or not more than 
twofold greater than the seen with T. aquaticus DNA polymerase. 

However, it was at the lower levels of target DNA sequence, 2.8femtograms, that differences in polymerase 

45 ' function were most apparent Under these conditions requiring maximal polymerase stability and/or efficiency 
at elongation of DNA during each cycle, the T. litoralis DNA polymerase produced greater than fourfold more 
amplified DNA than that of T. aquaticus DNA polymerase within 23 cycles. 

This ability to amplify very small amounts of DNA with fewer cycles is important for many applications of 
PCR since employing large cycle numbers for amplification is associated with the generation of undesirable 

50 artifacts during the PCR process. 

EXAMPLE X 

PURIFICATION OF RECOMBINANT T. LITORALIS INTRON-ENCODED ENDONUCLEASE 

55 

E. coli NEB671 (ATCC No. 68447), grown as described in Example IV, were thawed (70 grams) and sus- 
pended in Buffer A containing 200 ng of lysozyme per ml to a final volume of 300 ml. The mixture was incubated 
at 37°C for 2 minutes and then 75°C for 30 minutes. The heated mixture was centrifuged at 22,00 x g for 30 
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minutes and the supernatant was collected for further purification of the thermostable endonuclease. Since all 
of the nucleases from E. coli were inactivated by the heat treatment, the preparation at this stage could be used 
for characterization of the intron-encoded endonuclease. To separate this enzyme from the recombinant T. It- 
toralis DNA polymerase also present in the 75°C supernatant solution, the solution was passed through a 

5 DEAE-sepharose column (5 cm x 5 cm, 100 ml bed volume) and washed with 200 ml of Buffer A. Essentially 
all of the DNA polymerase activity passes through the column while the endonuclease activity sticks. The en- 
donuclease activity was eluted with a one liter linear gradient of NaCI from 0.1 M to 0.8 M formed in Buffer A. 
The endonuclease activity eluted at about 0.4 M NaCI, and was assayed in a buffer containing 10 mM KCI, 20 
mM Tris-HCI (pH 8.8 at 24°C), 10 mM (NH4) 4 S0 4 , 10 mM MgS0 4 , 0.1% Triton X-100 and 1 ug of pBR322 DNA 

10 per 0.05 ml of reaction mixture. The reaction mixture was incubated at 75°C and the extent of DNA cleavage 
was determined by agarose gel electrophorese. At lower temperatures little or no endonuclease activity was 
detected. The tubes containing the peak activity were pooled, dialyzed overnight against Buffer A and then ap- 
plied to phosphocellulose column (2.5 cm x 6.5 cm, 32 ml bed volume), washed with Buffer A and the endonu- 
clease activity eluted with a linear gradient of Nad from 0.1 M to 1 .5 M formed in Buffer A. The enzyme eluted 

15 at about 0.8 M NaCI. Active fractions were pooled and dialyzed overnight against Buffer A and then passed 
through a HPLC Mono-S column (Pharmacia) and eluted with a linear gradient of NaCI from 0.05 M to 1.0 M. 
The activity eluted as a single peak and was homogeneous by SDS-PAGE: a single 42-47 kd band was detected 
by Commasie blue staining and when this band was eluted from the gel and renatured it contained the only 
endonuclease activity detected on the gel. 

20 The enzyme has preferred cutting sites on various DNAs. When used in vast excess and in Vent polymer- 

ase buffer (New England Biolabs, Beverly, MA), the enzyme has cutting sites on lambda DNA and 3 sites on 
pBR322. Two of the rapid sites on pBR322 have been sequenced: 
Region including cut site at position 164: 

25 

E» TTGGTTATGCCGGTAC TGCCGGCCTCTT 3' 
3' AACCAATACGGC CATGACGGCCGGAGAA 5' 

30 . 

Region including cut site at position 2411: 

5' TTGAGTGAGCTGATAC CGCTCGCCGCAG 3' 

35 

3» AACTCACTCGAC TATGGCGAGCGGCGTC 5' 

When IVS2 was deleted from pPR969, the resultant plasmid, pAKK4 (Example XI) now contains a very 
sensitive fast site at the exon junction: 
40 Region including the cut site at IVS2 junction: 

5' GGTTCTTTATGCGGAC*AC/TGACGGCTTTATG 3' 
3' CCAAGAAATACGCC/TG*TGACTGCCGAAATAC 5' 

45 ' 

The astericks denote the boundary between the left exon and the right exon which have been brought to- 
gether by deletion of IVS2. 

Cleavage at the l-TIi I homing site occurs 100-fold more rapidly that at the "star" sites using reaction con- 
ditions of 50 mM TRIS, (pH 7.9), 10 mM MgCI 2 , 100mM NaCI and 1 mM DTT at50°C. Under these conditions, 
so the enzyme cut E. coli DNA 6-10 times. "Star" cleavage is enhance by NH 4 (10 mM), higher temperatures (70- 
80°C), and higher pH (8.8-10). 

Thus, the endonuclease from T. IHoralis resembles other intron encoded endonucleases reported in that 
there is often a four base 3' extension at the cut site and there can be degeneracy in the recognition sequence. 

The cut site in the intron minus gene is referred to as the homing site of the intron encoded endonuclease. 
55 It is believed in the art that the intron encoded endonuclease recognizes its cut site in the gene lacking the 
intron, and that the cutting of that DNA by the endonuclease leads to insertion of the intron at the homing site. 

The thermostable endonuclease of the present invention can be used in genetic manipulation techniques 
where such activity is desired. 
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EXAMPLE XI 

Construction of T. litoralis DNA Polymerase Expression Vectors with a Deleted IVS2 

Analysis of the deduced amino acid sequence of the T. litoralis gene in comparison to other alpha class 
DNA polymerases and to the endonuclease in the 1170 bp intervening sequence suggested that this intron in- 
terrupted the alpha polymerase Region I. If the first 3 amino acids preceding the endonuclease (Tyr Ala Asp) 
were joined to the Thr at aa 1472, then a good consensus Region I would be established (where underlined 
residues indicate identity): 

Region I: TYR GLY AS? THR ASP ScR 

Left junction: TYR ALA ASP SER VAL SER 

Right junction: VAL HIS ASN IHR ASP GLY 

Vent Pol Region I: TYR ALA ASP THR AS? GLY 

To facilitate this construction, a Seal site was created in the PCR primers by changing the codon usage 
for Lys 1076 and Val 1077 as follows: 



Amino acids: 


PHE 


LYS 


VAL 


LEU TYR ALA. ASP 


Original sequence : 


TTT 


AAG 


GTT 


CTT 


Altered sequence: 


TTT 


AAA 


GTA 


CTT 


Sea I site: 




A 


GTA 


CT 



The expression plasmid pAKK4 was created in a three-way ligation derived from the following components: 

1) An about 7959 bp fragment of pPR969 was derived by cleavage with Hindlll and EcoRI. 9ug of pPR969 
DNA was incubated with 1X NEBuffer 2 in a total volume of 0.1 ml with 40 units of Hindlll endonuclease 
and 40 units of EcoRI endonuclease for 1 hour at 37° C. Cleavage products were separated on a 0.7% GTG 
grade agarose gel (FMC) run in Tris Borate EDTA buffer. The appropriate band, about 8 kbp, was isolated 
by electroelution using an Elutrap elution apparatus (Schleicher and Schuell) using the manufacturer's rec- 
ommended running conditions. Following elution, the fragment was concentrated by ethanol precipitation 
and the recovery quantified by comparison with known weight standards on agarose gel electrophoresis. 

2) An about 638 bp fragment with Seal and EcoRI termini derived from a PCR product. The reaction mixture 
contained 1 X NEB Vent Polymerase Buffer, 0.1 mg/ml bovine serum alumen, 0.2 mM dNTPs (equimolar, 
each nucleotide), 0.9 u.g/ml pV174-1B1 plasmid DNA template, and 0.01 A 26 oU/ml of primer 72-150 
(5'ATAAAGTACTTTAAAGCCGAACTTTTCCTCTA3') and primer "JACK" (5*CGGCGCATATGATACTGGA- 
CACTGATTAC3'). 0.1 ml of the reaction mix was placed into each of five tubes, and the samples heated 
to 95°C for 3-5 minutes in a Perki n- Elmer Thermocycler. 1 U of Vent DNA polymerase was added to each 
reaction tube, and 15 cycles were run on the thermocycler consisting of 94° C- 0.5 minutes, 50°C - 0.5 
minutes, and 72°C - 2 minutes. The samples were pooled, phenol extracted and ethanol precipitated. The 
sample was resuspended in 50 uJ Tris-EDTA buffer and mixed with 40 ^l of dH 2 0, 10 uJ of 10X NEBuffer 
3, 60 units of Seal endonuclease and 60 units of EcoRI endonuclease. After incubation at 37°C for 1 .75 h, 
the reaction products were separated on a 1 .5% agarose gel and the ca. 638 bp fragment was electroeluted, 
and quantified as described above. 

( 3) An about 358 bp fragment with Hindlll and Seal termini derived from a PCR product. The reaction mixture 
contained 1 X NEB Vent Polymerase Buffer, 0.1 mg/ml bovine serum albumin, 0.2 mM dNTPs (equimolar, 
each nucleotide), 0.9 u.g/mi pV174-1B1 plasmid DNA template, and 0.02 A 2 6o/ml of primer 698 (5'GA- 
GACTCGCGGAGAAACTTGGACT3') , and primer 73-143 (5'TACAGTACTTTATGCGGACACT- 
GACGGCTTTTATGCCAC3'). 0.1 ml of the reaction mix was placed into each of five tubes, and the samples 
heated to 95°C for 3-5 minutes in a Perkin-EImer Thermocycler. 1 U of Vent DNApolymerase was added 
to each reaction tube, and 20 cycles were run on the thermocycler consisting of 94°C - 0.5 minutes, 50°C 
- 0.5 minutes, and 72°C-1 minute. The samples were pooled, phenol extracted and ethanol precipitated. 
The sample was resuspended in 50 uJ Tris-EDTA buffer and cleaved with Hindlll and Seal endonucleases. 
The reaction products were separated on a 1.5% agarose gel and the 358 bp fragment was electroeluted, 
and quantified as described above. 

The ligation reaction contained approximately 1 ng/ml of the pPR969 fragment described above, 0.8 u<j/ml 
of the 638 bp fragment described above, 0.4 ^ig/ml of the 358 bp fragment described above, 1X NEB ligation 
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buffer and 1 00,000 units/ml T4 DNA ligase. Ligation occured at 16°C for 5 hours Correctly constructed recom- 
binants were identified by the Seal digestion pattern, and transformed into BL21(DE3) plysS to screen for in- 
ducible activity, as described above. Two such isolates, pAKK4 and pAKK15 were used in subsequent studies. 
These two isolates appear to be identical, although they were isolated from independent isolates. 

5 Expression from the new construct pAKK4 appears to yield 3-10-fold more active T. litoralis DNA polymer- 

ase than pPR969 without expression of the endonuclease from the 1170 bp intron. 

An expression vector for production of the exonuclease deficient variant of the T: litoralis polymerase was 
constructed by replacing a 1417 bp Clal-Sphl fragment from pAKK15 with an analogous 1417 bp fragmentfrom 
pCBA1, the original exonucl ease-deficient T. litoralis DNA polymerase construct. One such recombinant was 

10 named pAKM8 and was characterized further. 

EXAMPLE XII 

PURIFICATION OF A THERMOSTABLE DNA POLYMERASE FROM PYROCOCCUS SPECIES 

15 

Pyrococcus sp. strain GB-D (ATCC No. 55239) was grown in the media described by Belkin, et al., supra, 
containing 10 g/l of elemental sulfur in 8 one liter bottles at 94°C for two days. The cells were cooled to room 
temperature, separated from unused sulfur by decanting and collected by centrifugation and stored at -70°C. 
The yield of cells was 1 .4 g per liter. 

20 1 1 .5 g of cells obtained as described above, were suspended in 28 ml of buffer A (1 0 mM KP04 buffer, pH 

7.4; 0.1 mM EDTA, 1.0 mM beta-mercaptoethanol) containing 0.1 M NaCI and sonicated for 5 minutes at 4°C. 
The lysate was centrifuged at 15,000 g for 30 minutes at 4°C. The supernatant solution was passed through 
a 18 ml Affigel blue column (Biorad). The column was then washed with 50 ml of buffer A containing 0.1 M NaCI. 
The column was eluted with a 300 ml linear gradient from 0.1 to 2.0 M NaCI in buffer A. The DNA polymerase 

25 eluted as a single peak at approximately 1.3 M NaCI and represented 90% of the activity applied. The peak 
activity of DNA polymerase (25 ml) was dialyzed against 1 liter of buffer A containing 100 mM NaCI, and then 
applied to 15 ml Phosphocellulose column, equilibrated with buffer A containing 100 mM NaCI. The column was 
washed with 50 ml of buffer A containing 100 mM NaCI, and the enzyme activity was eluted with 200 ml linear 
gradient of 0.1 to 1 .0 M NaCI in buffer A. The activity eluted as a single peak at 0.6 M NaCI and represented 

30 70% of the activity applied. The pooled activity (42 ml) was dialyzed against 500 ml of buffer A and applied to 
a 25 ml DEAE column. The column was washed with 50 ml of buffer A containing 0.1 M NaCI, and two-thirds 
of the enzyme activity passed through the column. The active fractions were pooled (30 ml) and applied to an 
1.0 ml HPLC mono-S column (Pharmacia) and eluted with a 100 ml linear gradient in buffer A from 0.05 to 1.0 
M NaCI. The activity eluted as a single peak at 0.22 M NaCI and represented 80% of the activity applied. 

35 Purified Pyrococcus sp. polymerase was electrophoresed in SDS 10-20% polyacrylamide gel and stained 

with either Coomassie Blue or the colloidal stain (ISS Problue) previously described to detect protein. A faintly 
staining protein band was seen at about 92,000 to 97,000 daltons; this molecular weight determination was 
obtained by comparison on the same gel to the migration of the following marker proteins (Bethesda Research 
Laboratories): myosin, 200,000 daltons; phosphorylase B, 97,400 daltons; BSA, 68,000 daltons; ovalbumin, 

40 43,000 daltons, carbonic anhydrase 29,000 daltons; b-Iactoglobulin, 1 8,400 daltons; lysoyzme 14,300 daltons. 

EXAMPLE XIII 

CLONING OF PYROCOCCUS SPECIES DNA POLYMERASE GENE 

45 - 1 

Cross hybridization of a Pyrococcus genomic DNA library using radioactive probes prepared from the DNA 
polymerase gene of T. litoralis allowed for the identification and isolation of a DNA encoding the Pyrococcus 
DNA polymerase. This was accomplished as set forth below. 

In order to determine which restriction enzymes would be most useful in preparation of the Pyrococcus gen- 
50 omic library, Pyrococcus sp. DNA was cut to completion with Eco Rl, BamHI and Hindlll. This DNA was subject 
to agarose gel electrophoresis (Figure 13A) and Southern hybridization (Figure 13B) using a DNA probe pre- 
pared as follows, A reaction mixture containing 1 \ig of the first EcoRI fragment of the T. litoralis DNA polymerase 
gene (bp 1-1274, obtainable from bacteriophage NEB#618, ATCC No. 40794) as a template in a commercial 
random priming kit (New England Biolabs, Inc.) was incubated for 1 hour at 37°C to produce a DNA probe of 
55 high specific activity. The probe was hybridized to Pyrococcus sp. DNA prepared above under moderately strin- 
gent conditions (Hybridization: overnight at 50°C, 4X SET, 0.1M sodium phosphate, pH 7, 0.1% Na pyrophos- 
phate, 0.1 % SDS, 1X Denhardts solution; Wash Conditions: wash 3X 20-30 min. 45°C, 0.1X SET, 0.1 M sodium 
phosphate, (pH 7), 0.1% Na pyrophosphate, 0.1% SDS. Maniatis, et al. t supra). A single major band at about 
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5 Kb was detected in BamH I cut Pyrococcus DNA. EcoR I and Hind III gave multiple bands with this probe, 
indicating that these enzymes cut within the Pyrococcus polymerase gene. 

Based on the results, a BamHI genomic library was constructed using the phage vector XDASH (Strata- 
gene). Partial and complete BamHI digests of Pyrococcus DNA were prepared. A mixture of the partial and com- 

5 pletely BamHI digested DNA was ligated into the BamHI site of XDASH. The ligation mixture was packaged 
using Gigapack Gold (Stratagene) according to manufacturer's instructions and plated on E. coli ER1458. The 
packaged phage library contained 1 x 10 8 phage per ml. 

32 P-labelled DNA probes of the 3 fragments (bp 1-1274, 1656-2660 and 3069-3737) of the T. litoralis DNA 
polymerase gene (obtainable from NEB#61 9, ATCC No. 40795) were prepared using a random primer kit (New 

10 England Biolabs, Inc.). The probes were used according to the method of Benton & Davis (Maniatis, et al. supra) 
to screen the Pyrococcus genomic library using hybridization conditions described above. About one per cent 
of the plaques were positive and ten positive plaques were picked and purified by reinfection and replating 3 
times (until 90-100% of the plaques were positive for each isolate). Large amounts of phage were prepared 
from each isolate and used to infect E. coli cultures. Specifically, plate lysates (Maniatis et al., supra) of phage 

15 were prepared from each isolate and used to infect E. coli cells. 0.1 ml of each plate lysate was mixed with E. 
coli with 0.2 ml of cells (OD 6 oo=2). The bacterial cells were harvested just before lysis and suspended in 0.05 
M NaCI, 0.01 M Tris (pH 8.0), 0.1 mM EDTA, 0.1% Triton X-100 and 200 u.g/ml lysozyme (3 volumes pervolume 
of cells) and heated to 37°C for about 1 minute or until cell lysis occured. The lysed extracts were immediately 
heated at 75°C for 30 minutes, centrifuged and the supernatant solution assayed for heat stable DNA polymer- 

20 ase activity, according to the method described above. Three of the ten isolates showed significant polymerase 
activity and the clone (B9) showing the most activity was investigated further. 

The phage DNA was isolated from B9 and the insert DNA was examined by restriction enzyme digestion. 
Digestion with Sal I gave the expected two arms of XDASH plus a 15 Kb insert. Digestion with BamH I gave 
the two arms of XDASH plus three insert fragments of 7, 4.8 and 3 Kb. Each of these fragments were purified 

25 by agarose gel electrophoresis, eluted and ligated into the BamH I site of pUC19. The ligation mixture was used 
to transform E. coli ER2207 which gives white colonies when plasmids contain an insert and blue colonies with 
no inserts on indicator agar media (X-gal plus IPTG). No white transformants were obtained with the 7 Kb frag- 
ment Three whites and twenty-seven blue transformants were obtained with the 4.8 Kb fragment and twenty 
white and twenty-one blue transformants were obtained with the 3 Kb fragment All three 4.8 Kb white colony 

30 transformants expressed heat stable DNA polymerase activity. None of the transformants with the 3 Kb frag- 
ment expressed heat stable polymerase activity. The three clones carrying the 4.8 Kb Pyrococcus DNA frag- 
ment all had about the same specific activity for heat stable DNA polymerase and one was picked for further 
study (NEB#720). This clone designated NEB#720 was deposited with the American Type Culture Collection 
on October 1, 1991 and bears ATCC No. 68723. A restriction endonuclease map of the 4.8 Kb BamH I fragment 

35 containing the Pyrococcus sp. DNA polymerase gene is shown in Figure 14. A partial DNA nucleotide sequence 
coding for Pyrococcus sp. DNA polymerase (NEB720) is set forth in Figure 18, including the start of the poly- 
merase gene at bp 363 and a portion of the intervening nucleotide sequence (bp 1839-3420). NEB#720 yielded 
1700 units of DNA polymerase activity per gram of cells and was used for the large scale preparation of this 
enzyme. 

40 A portion of the Pyrococcus sp. DNA polymerase clone has been sequenced (Fig. 18, bp 1-3420). The se- 

quence of the Pyrococcus sp. DNA polymerase is very similar to the 7^ litoralis DNA polymerase at both the 
DNA and protein level (similarity calculated using the GCG Bestfit Program, Smith and Waterman, Advances 
in Applied Mathmatics, 2:482 (1981)). Overall, the genes are 66% identical, with 69% identity in the mature 
DNA polymerase amino termini regions (bp 363-1838 in Pyrococcus sp. DNA polymerase) and 63% identical 

45 - in the portion of IVS1 sequenced to date (bp 1839-3420 in Pyrococcus sp. DNA polymerase). The upstream 
regions (bp 1-362 in Pyrococcus sp. DNA polymerase, Fig. 18 and bp 1-290 in 7: litoralis DNA polymerase, 
Fig. 6) show no similarity according to the Bestfit Program. 

Similarity at the protein level is even higher. In the 1 019 amino acid Pyrococcus sp. DNA polymerase coding 
region, the two polymerases have 83% similarity and 68% identity (Fig. 1 9). When broken down into the mature 

50 polymerase amino terminus and IVS1 , the polymerase coding exons are more similar than the intervening se- 
quence, with the mature polymerase amino termini (aa 1-492 in Pyrococcus sp. DNA polymerase) being 89% 
similar, and 78% identical, and IVS1 (aa 493-1019 in Pyrococcus sp. DNA polymerase) being 78% similar and 
60% identical. 

55 
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EXAMPLE XIV 

ARCHAEBACTERIA DNA POLYMERASE SIMILARITIES AT THE DNA LEVEL 

5 The degree of cross-hybridization between the T. litoralis DNA polymerase gene and the DNA polymerase 

genes from 3 other thermophytlic archaebacteria and from Taq DNA was assessed by Southern blot hybridi- 
zation (Maniatis, supra). Chromosomal DNA from T. litoralis and Pyrococcus sp. (Strain GB-D), T. aquaticus, 
and two other Pyrococcus strains, G-1-J and G-1-H, were cleaved with either EcoRI or BamHI. 5 \xg of each 
DNA was incubated with 1x NEBuffer (EcoRI buffer for EcoRI endonuclease and BamHI buffer + 1X BSA for 

10 BamHI endonuclease) in a total volume of 60 u.! with 20 units of EcoRI endonuclease or 20 units of BamHI en- 
donuclease for 2 hours at 37°C. Four quadruplicate 0.75ng samples of each of the cleaved DNAs were loaded 
and run on a 1% agarose (SeaKem LE) gel in Tris Acetate EDTA buffer (Maniatis, supra). The gel was stained 
with Ethidium Bromide (1\ig/ml) for 20 minutes at room temperature and a photograph taken with a ruler besides 
the gel. 

75 The DNA was transferred from the gel onto nitrocellulose paper using the method developed by Southern 

(Maniatis supra). Nitrocellulose filter paper {QA5\xm) was cut to the size of the gel and soaked in 200ml of 6x 
SSC (0.9M NaCI, 0.09M Sodium Citrate) for greater than 1 hour at 37°C. Meanwhile, the gel was incubated 
for 15 minutes in 200 ml of 0.25M Hydrochloric acid at room temperature, then rinsed with distilled water. The 
gel was then incubated for 30 minutes in 200 ml 0.5M Sodium Hydroxide, 1M Sodium Chloride at room tem- 

20 perature, then rinsed with distilled water. The gel was then incubated for 30 minutes in 200-mls 1M Tris HCI, 
pH7.5, 3M Sodium Chloride at room temperature, transfer of the DNA from the gel onto the nitrocellulose was 
carried out at 4°C in 18X SSC (2.7M Sodium Chloride, 0.27M Sodium Citrate), 1M Ammonium Acetate. After 
6 hours the nitrocellulose was removed and washed in 1x SSC (0.1 5M Sodium Chloride and 0.01 5M Sodium 
Citrate) for 30 seconds. The nitrocellulose filter was air dried and then vacuum dried at 80°C for a further 2 

25 hours and then stored at room temperature. 

Four gel purified fragments of T. litoralis DNA polymerase DNA, (1.3 kb Eco Rl fragment from bp 1-1274 
representing the 5' polymerase coding region; bp 471 8-5437, representing the 3' polymerase coding region; 
bp 2448-2882, representing part of IVS1; and bp 3666-4242, representing part of IVS2, FiguresSand 15) were 
radiolabelled using the New England Biolabs Random Primer Kit. 100ng of the above template DNAs, each in 

30 a volume of 35.5 pj, were boiled for 5 minutes in a boiling water bath and then cooled on ice for 5 minutes and 
spun down. The template DNAs were incubated with 1X labelling buffer (includes random hexanucleotides), 
1/10 volume dNTP mix, 25uCi a 32 P dCTP and 5 units DNA Polymerase l-Klenow fragment in a total volume of 
50|il for 1 hour at 37°C. The reactions were stopped with 0.0 18M EDTA. The probes were purified using an 
Elutip minicolumn (Schleicher and Schuell) following the manufacturers recommended elution conditions. The 

35 total number of counts were calculated for all purified probes. The 1.3 kb Eco Rl fragment probe (bp 1-1274) 
yielded 24 x 10 6 cpm, the 3' polymerase probe (bp 4718-5436) yielded 22 x 10 6 cpm, the IVS1 probe yielded 
54 x 10 6 cpm, and the IVS2 probe yielded 47 x 10 6 cpm. 

Hybridization was carried out as follows (Maniatis supra). The nitrocellulose filter was incubated for 30 min- 
utes in 5mls prehybridization buffer (0.75M Sodium Chloride, 0.1 5M Tris, 10 mM EDTA, 0.1% Sodium Pyro- 

40 phosphate, 0.1% Sodium Lauryi Sulphate, 0.2% Bovine Serum Albumin, 0.2% Ficoll 400, 0.2% PVP and 100 
ng/ml boiled calf thymus DNA) at 50°C. Each nitrocellulose filter was then placed in separate bags with 5mls 
hybridation buffer (as above except 0.03% Bovine serum albumin, 0,03% Ficoll 400, and 0.03% PVP). Each 
section was hybridized with 22-25 x 10 6 cpm of denatured probe overnight at 50°C, 

The nitrocellulose filters were removed from the bags and incubated 3 x 30 minutes with 0.1X SET Wash 

45 ' (l5mM NaCI, 3mM Tris base, 0.2 mM EDTA, 0.1% SDS, 0.1% Sodium Pyrophosphate and 0.1M Phosphate 
Buffer) at 45°C. The filters were kept moist, wrapped in Saran Wrap and exposed to X-ray film for various times 
ranging from 4 hours to 3 days. 

The results are shown in Figure 16. In Figure 16, parts A through D are autoradiographs of quadruplicate 
Southern blots. Lanes 1-5, DNA cut with EcoRI. Lanes 6-10, DNA cut with BamHI. Lanes 1 & 6, Pyrococcus 

50 sp. G-1-J DNA; Lanes 2 and 7, Pyrococcus G-1-H DNA; Lanes 3 & 8, T. litoralis DNA; Lanes 4 and 9, Pyrococcus 
sp. GB-D DNA, Lanes 5 & 1 0, T. aquaticus DNA. The hybridization probes are as follows: part A, 5' coding region 
of T. litoralis DNA polymerase gene, bp 1-1274; part B, 3' coding region of T. litoralis DNA polymerase gene, 
bp 4718-5437; Part C, partial IVS2 probe, bp 3666-4242; Part D, partial IVS1 probe, bp 2448-2882. The upper 
and lower panels of parts C and D represent shorter and longer exposures, respectfully, of the same blots. 

55 None of the 4 probes hybridized to Taq DNA. Both polymerase coding region probes hybridize to specific 

bands in all Thermococcus and Pyrococcus DNAs, but not Taq DNA. Good signals were obtained with both 
probes indicating strong conservation of both the amino and carboxy terminal ends of the 7". litoralis DNA Poly- 
merase coding region. The amino terminal regions of T. litoralis and Pyrococcus sp. GB-D are about 69% iden- 
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tical (see, e.g. Figs. 6 and 1 8) and very similar at the protein level (Fig. 1 9). The IVS1 probe hybridized strongly 
to T. litoralis and Pyrococcus sp. GB-D DNAs (about 63% identical over a 1582 bp region) and weakly to Pyr- 
ococcus sp. G-1-H DNA. The IVS2 probe hybridized strongly to T. litoralis DNA and weakly to Pyrococcus sp. 
G-1-HDNA. 

EXAMPLE XV 

ARCHAEBACTERIA DNA POLYMERASE SIMILARITIES AT THE ANTIBODY LEVEL 

Pellets from 1 ml cultures of T. litoralis, and Pyrococcus strains were resuspended in 100uJ Urea lysis buffer 
(4M Urea, 0.12M Tris, 4% Sodium Lauryi Sulphate, 10% p-mercaptoethanol, 20% glycerol and 0.002% Bro- 
mophenol Blue) and boiled for 3 minutes. The boiled samples were sheared with 25G5/8 needle to reduce the 
viscosity of the samples. Duplicate 10jxl samples of T. litoralis, and Pyrococcus strains G-1-J and G-1-H, and 
also samples of purified Taq DNA polymerase, E. coli DNA polymerase and purified DNA polymerase from Pyr- 
ococcus sp. (GB-D) were loaded onto 10-20% SDS-PAGE gels and run in Protein Running Buffer (0.1% Sodium 
Lauryi Sulphate, 0.19M Glycine, and 0.025M Tris Base). Nitrocellulose filters (45^m) were soaked in distilled 
water for 5 minutes and then soaked in Transfer buffer (0.15% ethanolamine, 20 mM Glycine and 20% Metha- 
nol) for 30 minutes. The protein on the gels were etectroeluted (30 volts, overnight at 4°C) onto the nitrocellulose 
filters in Transfer buffer (Towbin, et al. PNAS (1 979) 76:4350-4354). 

The nitrocellulose was removed, marked with a ball point pen and washed for 5 minutes in TBSTT (20 mM 
Tris, 1 50mM Sodium Chloride, 0.2% Tween 20, and 0.05% Triton X- 1 00). The filters were blocked for 30 minutes 
in TBSTT + 3% nonfat dry milk (Carnation), and washed 3x3 minutes in TBSTT. The anti-I litoralis DNA poly- 
merase antisera was raised against a partially purified native DNA polymerase preparation. T. litoralis DNA poly- 
merase specific sera was prepared by affinity purification on Western blot strips of purified native enzyme (Beall 
et al., J. Immunological Methods86:217-2Z3 (1983)). Affinity purified anti-T. litoralis DNA polymerase mouse 
antibody (V76-2+3) and monoclonal anti-Taq polymerase antibody (diluted 1:100 in TBSTT) were added sep- 
arately to each nitrocellulose filter for 5 hours at room temperature. The filters were washed 3x3 minutes with 
TBSTT and then reacted with a 1 :7500 dilution of anti-mousse secondary antibody conjugated with alkaline 
phosphatase (Promega) in TBSTT for 1 hour at room temperature. The nitrocellulose filter was developed with 
NBT/BCIP as instructed by the manufacturers (Promega). The results using Taq monoclonal are shown in Fig- 
ure 17. Figure 17 is a Western blot of crude lysates from 7". litoralis (V), Pyrococcus sp. G-1-J (J), and Pyro- 
coccus sp. G-1-H (H), or purified polymerases from Pyrococcus sp. GB-D (DV), T. aquaticus (T) or E. coli (E) 
reacted with affinity purified anti-7: litoralis DNA Polymerase antibody in Part A or anti-Taq DNA polymerase 
monoclonal antibody in Part B. The arrow indicates the position of the 7! litoralis and Pyrococcus sp. DNA Poly- 
merase proteins. The reactivity in Part B is to background proteins and not to the DNA polymerases as seen 
in part A. 

Monoclonal antibody specific to Taq DNA polymerase does not cross-react with protein form the Pyrococ- 
cus and Thermococcus strains tested. 

However, the 90-95,000 dalton DNA polymerase proteins from T. litoralis and the 3 Pyrococcus strains re- 
acted with the affinity purified anti-I litoralis DNA polymerase antibody. This is not surprising, considering the 
high degree of both similarity and identity between T. litoralis and Pyrococcus sp. GB-D DNA polymerases (Fig. 
19). 

Figure 19 is a comparison of a portion of the deduced amino acid sequences of recombinant T. litoralis 
and the partial sequence of recombinant Pyrococcus sp. DNA polymerase. The Pyrococcus DNA polymerase 
deduced amino acid is listed on the upper line, and the deduced amino acid sequence of recombinant T. litoralis 
DNA polymerase is listed on the lower line. Identities are indicated by vertical lines, similariteies are indicated 
by 1 or 2 dots, honconserved substitutions are indicated by blank spaces between the two sequences. 

EXAMPLE XVI 

In order to obtain recombinant thermostable DNA polymerase from a target archaebacterium, several basic 
approaches to cloning the target DNA polymerase gene can be followed. Initially, one attempts to determine 
immunologically whether the new polymerase is a member of the Pol a or Pol I family by Western blot analysis 
of purified polymerase (although crude polymerase lysates may work with reduced sensitivity) using anti-Taq 
DNA polymerase or anti-7: litoralis DNA polymerase sera, as described in Example XV of this invention (Figure 
17). If the new polymerase reacts with anti-Taq Polymerase monoclonal, then it probably cannot be easily 
cloned using reagents generated from T. litoralis DNA Polymerase. If the new polymerase cross-reacts with 
anti-7: litoralis sera, then one should be able to clone it with the following procedures. If the new polymerase 
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fails to react with either sera, then the experiment is considered inconclusive and one should go onto the next 
step, DNA cross-hybridization. 

Optimum probes and DNA hybridization conditions must be experimentally determined for each new or- 
ganism. At the same time, various restriction digests of DNA from the new organism are tested in order to find 
5 enzymes which yield fragments which hybridize to the T, (itoratis probe and are large enough to encode the 
new polymerase. 

Probe selection can vary with respect to size and regions of the T. litoralis DNA Polymerase gene. Optimum 
probes can be determined by performing test Southern biots as described below with large or small DNA frag- 
ments, or even oligomers. One could select probes that are totally from within the IVS sequences to look for 

10 the presence of IVSs in new archaebacterium DNA polymerase genes, or probes could be limited to mature 
polymerase coding regions. Using the entire T. litoralis DNA Polymerase gene region as the probe has several 
advantages and disadvantages. The major disadvantage is that the larger the probe, the more likely to yield 
spurious hybridization at very low stringency. Among the advantages of using larger probes are (1) they are 
more likely to cross-hybridize to another polymerase which may have diverged greatly from the T. litoralis DNA 

15 Polymerase gene in one small portion of the polymerase, and (2) they are more likely to detect internal restric- 
tion sites in the new polymerase gene since the probe spans the amino- and carboxy-termimi of the 7". litoralis 
DNA Polymerase gene. It is important at the initial stages of probing to use several restriction enzymes to cleave 
the DNA from the new archaebacterium to find one or more enzymes which yield preferably one, or possibly 2 
bands, which hybridize to the 7". iitoralis DNA Polymerase probe and which are large enough to encode the 

20 new polymerase. The minimum coding sequence required for the new polymerase can be estimated from the 
size of the new polymerase determined by Western blots (assuming a factor for IVSs, if desired) or, by guessing 
at greater than 4 KB as a first approximation. Maximum fragment size is limited by the cloning capacity of the 
desired vector. 

Optimum hybridization conditions are experimentally determined by performing test Southern blots at va- 

25 rious wash temperatures. Hybridization is carried out at 50°C in 4X SET, 0.1 M sodium phosphate, pH 7, 0.1% 
Na pyrophosphate, 0.1% SDS, 1X Denhardts solution, although any low stringency hybridization condition 
would also be suitable (Maniatis). Wash conditions are varied from 37-55°C, 3 x 30 minutes with 0.1X SET 
wash (15mM NaCI, 3mM Tris base, 0.2mM EDTA, 0.1% SDS, 0.1% Sodium Pyrophosphate and 0.1M Phos- 
phate Buffer), although any standard low stringency wash conditions can also be used. The point of this part 

30 of the experiment is to hybridize the probe and wash the Southern blot at low stringency to insure some level 
of cross- hybridization which may even include non-specific cross-hybridization. Next, one increases the wash 
stringency, for example, increasing the wash temperature in 3-5°C increments and then monitoring the disap- 
pearance of hybridized probe as determined by a decrease in signal upon autoradiography. Initially, one ex- 
pects to see many bands hybridizing to the probe at low stringency. As the wash stringency increases, weakly 

35 hybridizing sequences melt off and disappear from the autoradiograph. As wash stringency is increased, con- 
ditions are established at which only one or a few bands still hybridize to the probe. These are the conditions 
to be used in future experiments. As stringency increases beyond this point, all hybridization signal is lost. The 
goal is to determine the most stringent condition where one or a few bands per digest still hybridize to the probe 
before all hybridization signal is lost. 

to If initial probing with a large T. litoralis DNA polymerase gene fragment fails to give a clear pattern using 

any hybridization conditions, then smaller probes can be tested until a good partnership of probe size and hy- 
bridization conditions are established. Alternatively, Example XIV of the present invention shows that several 
fragments spanning different regions of the T. litoralis DNA polymerase gene (amino terminus, IVS1 , IVS2 and 
carboxy terminus, Figures 1 5 and 1 6)) can be used in separate Southern blots, but tested in parallel at the same 

45 - time. 

Libraries are constructed with the optimum restriction digests and hybridized with the optimized probe. A 
parallel approach is to clone in expression vectors and directly screen with anti-T. litoralis sera. Either primary 
approach may yield active or inactive product. If no active polymerase is detected, the clone is checked for 
insert size and reactivity to anti-7. litoralis sera. If there is no reactivity to anti-I litoralis sera, then the poly- 

50 merase may not be expressed from its own control sequences in E. coli and the pi asm id insert must be se- 
quenced to operably link the new polymerase to an £ coli promoter and perhaps translation signals. 

In the present invention, we have identified introns or intervening sequences in Pol a conserved region 
motifs in both T. litoralis and Pyrococcus sp. DNA polymerase genes. We therefore predict that other Archae 
DNA polymerase genes may have introns in conserved motifs also. If the new polymerase clone is inactive, it 

55 should be checked for the presence of intervening sequences. These introns can be identified in 2 ways. If these 
introns are related to introns found in T. litoralis and Pyrococcus sp. DNA polymerase genes, they can be iden- 
tified by low stringency hybridization to DNA probes derived from intron sequences of T. litoralis and Pyrococcus 
sp. DNA polymerase genes. If IVSs are found, the clone is sequenced to develop strategies for removal of the 



34 



EP 0 547 920 A2 



IVS. If the clone is inactive and no cross-hybridizing IVSs are found, then the plasmid is sequenced to look for 
new IVSs. The archaebacterium DNA polymerase gene can be sequenced at the DNA level and the sequenced 
compared to (1) other DNA polymerases to identify non-similar segments (2) conserved motifs to look for the 
absence of Regions l-VI, followed by identification of interruption points in Regions which are absent. Once 

5 identified, introns can be removed in vitro by any number of techniques known in the art, some of which are 
described in this application with respect to to removal of IVS1 and IVS2 from the I litoralis DNA polymerase 
gene. 1 

If the primary library screening fails to produce a clone synthesizing active thermostable DNA polymerase, 
but does result in a partial gene clone as determined by (1) cross- hybridization at the DNA level, (2) cross- 
to reactivity at the antibody level, and (3) similarity to other DNA polymerases at the DNA sequence or deduced 
amino acid sequence levels, then more genomic Southern blots are probed with the initial clone to identify re- 
striction enzymes to be selected for making the next library. The second library should contain larger fragments 
which are more likely to encode the entire polymerase gene. The library is screened with either antibody or 
preferably, the initial new polymerase cloned sequence. The resultant positives are checked for thermostable 

15 DNA polymerase activity. If no active thermostable DNA polymerase is detected in this second round, then in- 
tervening sequences can be screened for by cross-hybridization and DNA sequencing. DNA sequencing can 
also indicate whether the cloned gene is complete by establishing the presence of all theconserved polymerase 
motifs and a stop codon in the polymerase open reading frame. Several rounds of screening and rescreening 
may be necessary before finally cloning an active thermostable DNA polymerase. 

20 It should also be noted that the above screening and rescreening procedure may not be sufficient for cloning 

the new thermostable polymerase gene because of toxic elements present in the gene. In this case, cross- 
reactivity at the DNA or protein level is an excellent method of cloning because only partial, inactive products 
can initially be cloned which will allow subsequent cloning of the complete gene. If obtaining the complete gene 
is not straightforward using the strategy outlined above, one should look for the presence of intervening se- 

25 quences like IVS2 which are very toxic when cloned. This is accomplished by either looking for deletions and 
rearrangements in polymerase clones or by v probing for known toxic T. litoralis IVS sequences. Duplicate South- 
ern blots are probed with polymerase coding regions and IVS sequences to locate toxic IVSs in proximity to 
the polymerase coding region. If rearrangements or toxic IVSs are found, then the appropriate strategy would 
be to first operably link the amino terminal of the polymerase to a very tightly controlled expression system as 

30 described in this present application. Once accomplished, the remainder of the polymerase gene can be cloned 
and ligated to the amino terminus, reducing expression of toxic elements such as the T. litoralis IVS2 sequence. 
Alternatively, cross-hybridizing sub-fragments of the polymerase gene can be isolated, checked for IVSs by 
hybridization or DNA sequencing, IVSs can be removed in vitro from these regions by methods known in the 
art. The complete polymerase gene can then be constructed by ligation of sub-fragments from which toxic ele- 

35 ments have been removed. 



Claims 

40 1. Recombinant thermostable DNA polymerase from archaebacteria which is encoded by a DNA sequence 
which hybridizes to a nucleotide sequence selected from the group consisting of the nucleotide sequence 
of Figure 6 or a portion thereof, nucleotides 1 to 1274 of Figure 6, nucleotides 1269 to 2856 of Figures 6 
and nucleotides 2851 to 4771 of Figure 6. 

45 ' 2. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 20 nucleotides in length. 

3. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of . 
Figure 6 is at least about 50 nucleotides in length. 

50 

4. The recombinant thermostable polymerase of claim 1, wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 150 nucleotides in length. 

5. Recombinant thermostable DNA polymerase from archaebacteria which hybridizes to an antibody probe 
^ which has antigenic specificity to 7^ litoralis DNA polymerase. 

6. Isolated DNA which codes for the recombinant thermostable DNA polymerase of claim 1. 

7. A cloning vector comprising the isolated DNA of claim 6. 
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8. A host cell transformed by the vector of claim 7. 

9. A method for producing a recombinant thermosta ble DNA polmerase from archaebacteria comprising cui- 
turing a host cell transformed with the vector of claim 7 under conditions suitable for the expression of the 
DNA polymerase. 

10. A DNA probe which hybridizes to the DNA sequence coding for the archaebacteria thermostable DNA poly- 
merase of claim 1 .wherein the DNA probe is selected from the group consisting of the nucleotide sequence 
of Figure 6 or a portion thereof, nucleotides 1 to 1274 of Figure 6, nucleotides 1269 to 2856 of Figure 6 
and nucleotides 2851 to 4771. 

11. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
20 nucleotides in length. 

12. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
50 nucleotides in length. 

13. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
1 50 nucleotides in length. 

14. A method for isolating DNA coding for thermostable DNA polymerase from archaebacterium comprising 
the steps of: 

(a) forming a genomic library from the archaebacterium; 

(b) transforming or transfecting an appropriate host cell with the library of step (a); 

(c) contacting DNA from the transformed or transfected host cell with a DNA probe selected from the 
group consisting of the nucleotide sequence of Figure 6 or a portion thereof, nucleotides 1 to 1274 of 
Figure 6, nucleotides 1269 to 2856 of Figure 6 and nucleotides 2851 to 4771; 

(d) assaying the transformed or transfected cell of step (c) which hybridizes to the DNA probe for DNA 
polymerase activity; and 

(e) isolating a DNA fragment which codes for the thermostable DNA polymerase. 

15. A method for isolating DNA coding for thermostable DNA polymerase from archaebacterium comprising 
the steps of: 

(a) forming a genomic library from the archaebacterium; 

(b) transforming or transfecting an appropriate host cell with the library of step (a); 

(c) contacting extract from the transformed or transfected host cell with an antibody probe which has 
specific affinity for T. litoralis DNA polymerase; 

(d) assaying the transformed or transfected cell of step (c) which is cross-reactive to the antibody probe 
for DNA polymerase activity; and 

(e) isolating a DNA fragment which codes for the thermostable DNA polymerase. 

16. A method for increasing the expression of a thermostable DNA polymerase from archaebacteria compris- 
ing the steps of: 

(a) identifying and locating any intervening nucleotide sequence in the isolated DNA of claim 14 or 1 5; 
and 

(b) removing the intervening nucleotide sequence from the isolated DNA. 

17. The method of claim 16, wherein the archaebacteria comprises T. litoralis. 

18. The method of claim 17, wherein the intervening nucleotide sequence is selected from the group of IVS1, 
IVS2orlVS1 and!VS2. 

19. The method of claim 16, wherein the intervening nucleotide sequence is identified and located with a DNA 
probe coding for an intron from the DNA sequence encoding T. litoralis DNA polymerase. 

20. The method of claim 19, wherein the DNA probe is selected from the group consisting of a 1614 bp nu- 
cleotide sequence of Figure 6 comprising nucleotides 1776 to 3389 or a portion therof and an 1170 bp 
nucleotide sequence of Figure 6 comprising nucleotides 3544 to 4703 or a portion thereof. 

21. A thermostable endonuclease obtainable from T. litoralis which cleaves double-stranded deoxynucleotide 
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acid pBR322 at position 1 64 and 241 1 . 
22. The endonuciease of claim 21 , having a molecular weight of about 33,000-37,000. 
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Fig. 1A. SDS-Polyacrylamide Gel of Purified 
T. iitoraiis DNA Polymerase 
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Lane 1: Molecular weight markers 

Lane 2: Purified T. Iitoraiis DNA Polymerase 
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MOLECULAR WEIGHT MARKERS 



SIZE DETERMINATION OF T.IHoralis ON A POLYMERASE 

FUNCTIONS 
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mariCGCGA tmaaictat tticitcctc cnarrrrcAA ottcaaaaac gtaagcatca 60 

GOCAAACCTC TOGQOTITC TCTCICCTIC OOGCEAACCC TCITCAAAAC TCICTCCAAA 120 

GCAnTTTTG ATGAAAGCTC M3CTCCTCT ATCAGGGICA GTATATCTGC AATGAGTTCG ' ISO 

TGAAGGGTTA riCTCTAGAA CAACTCCAIG ATTTTOGATT TGGATGGGGG TTH^AAAATT 240 

TGGCGGAACT TTTATITAAT TTGAACTCCA GTTTAIATCT GGTCCTATTT ATGATACTGG 300 

ACACram* CATOACAAAA GATGGCAAGC CTAIAATCCG AATTTTIMG AAAGAGAACG 360 

GGGMTITAA AAXAGAACIT GMCCTCATT TICAGOCCTA TATATATCCT CITCTC\AAG 420 

ATCSCTCOGC TAOTCTGGAG AIMAGGCAA TAAAGGGCGA GAGACATCGA AAAACTGTGA 480 

GACTCCTOGA TGCACTSAAA GTCAGGAAAA AAITTTTGGG AAGGGAAGTT GAAGTCTGGA 540 

AGCTCATTIT OOTGCATOCC CAAGACGTTC CAGCIATGCG GGGCAAAATA AGGGAACVIC 600 

CW3CTGTGGT TGACATTIAC GAAIATCACA TTOCCTTTGC CAAGCGTTAT CICAXAGVCA 660 

AGGGCITGAX TCXXATGG7-J3 GGAGMGAGG AGCTTAAGCT CCTTGCCTTT GATATTGAAA 720 

CCTITIATCA TCAGGGAGAT GAAXTTGGAA AGGGOG^GAT AAXAATGATT AGTTATC-CCG 780 

ATCAAGAAGA GGCCAGACTA ATCSCATGGA AAAATATCGA TTIX300GTAT GTCGATGTTG 840 

TCTCCCAATGA AAGAGAAATC AXAAftGCGTT TTCTTCAAGr TCTTAAAGAA AAAGACCCCG 900 

ATCTCATAAT AACTTACAAT GGGGACAATI TTGAXITGCC GTATCTCATA AAACGGGCAG . 960 

AAAAGCTGGG ACTTQGGCIT GTCTT7^GGAA GGGACAAAGA ACATCCCGAA OOC^GVTTC 1020 

AGAGGATGGG TGAXRGTTTT GCICTGGAAA TCAAGGGIAG AATCCACTIT GATCTTTTCC 1080 

FIG. 6-1 
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AAOCTCCCAA CCTTCVCGCT TGAGGCAGTT T7\TCAAGCAG 
TTITAGGAAA AMCAAAAGC AAATIAGGAG CAGAGGAAAT TGCCGCT3VTA TGGGAAACAG 
AAGAAAGQ\X GAAAAAACTA GOOT^CT CAATGGAAGA TtSCTAGGGCA ACGTATGAGC 
TOGGGAAGGA AITCITOCOC AXOGRMCTO AGCTGGCAAA GCTGAT7VGGT CAAAGTGXAT 
GGGAOCTC7TC GAGAICAAGC ACCGGCAACC TCGTGGAGTG GXATCTTTTA AGGGTGGCAT 
ACGOGAGGAA TGAPCTTGCA- CCGAACAAAC CTGAXGAGGA AGAGTATAAA CGGCGCTTAA 
GAACAACTm CCTGGGAGGA TATGTAAAAG AGCCAGAAAA AGGTTTGTGG GAAAATATCA 
TTTATITGGA riTOCGCACT CTGTACCCTT CAAXAATAGT TACICACAAC GTATCCCCAG 1560 
AmCCCTTGA AAAAGAGGGC TGIAAGAATT AOGATC7ITCC TCCGA17\GI* GGATTvTAGGT 1620 
TCTGCAAGGA CTTTOOGGGC TTTATTOCCT CCATACTCGG GGACTTAAXT GCAATGAGGC 1630 
.AAGATASAAA GAAGAAAATG AAATCCACAA TTGACCCGAT CGAAAAGAAA ATGCTCGATT 1740 
AIAGGCAAAG CX3CTATI7VAA TIOTTGCAA ACAGCATCTT ACCCAACGAG TCCTI7.CCAA 1800 
TAATTGAAAA TCGAGAAATA AAAXTCGTGA AAAITGGCGA GTTTAXAAAC TCITACATGG I860 
AAAAACAGAA GGAAAACGTT AAAACACTAG AGAATACTGA AGITCICGAA GTAAAO\ACC 1920 
TTTITGCATr CICATTCAAC AAAAAAATCA AAGAAAGTGA AGTCAAAAAA GTCAAAGCCC 1980 
TCATAAGACA TAAGTATAAA GGGAAAGCTT ATGAGATTCA GCITAGCICT GGTAGAAAAA 2040 
TTAACA1AAC TCCTGGCCAT AGICTGTTTA CAGTCAGAAA IGGPjSAAATA AAGGA^GTIT 2100 
CTCGAGATGG GATAAAAGAA GGTGACCTTA TTGXAGCACC AAAGAAAATT AAACTCTVATG 2160 
AAAAAGGGGT AAGCATAAAC ATTOCOGAGT TAATCTCAGA TCTTTCCGAG GAAGAAACAG 2220 
CCGACA1TGT GATGACGAXT TCAGCCAAGG GCAGAAAGAA CTTCTTTAAA GGAATGCTGA ~ 2280 



F IG. 6 - 2 
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GAACITTAAG CTQGATCITr GGAGAAGAAA AIAGAAGGAX AAGAACAITT AATCGCTRTT 2340 

TCJTTCCATCT CGAAAAACTA GGCCTTATCA AACTACTGCC COGOGGAT7VT GAAGirACTG 2400 

ACTGGGajSAG ATTAAAGAAA TMAAACAAC TTTACGAGAA GCTTGCTGGA AGCGITAAGT 2460 

ACAACGGAAA CAAGAGAGM TAITIAGTAA TCTTCAACGA GATCAAGGAT TTIMTiTCTT 2520 

ACITOCCACA AAAftGAGCTC GAAGAATGGA AAA2TGGAAC TCTCAATGGC TTTAGAACGA 2580 

AXTGTAITCT CAAAGTCGAT GAGGATTTTG GGAAGCTCCT AGGTTACTAT GTTAGTGAGG 2640 

GCTATGCAGG TGCACAAAAA AATAAAACTG GTGGTATOG TTA1TOGGTG AAGCTTTACA 2700 

ATGAGGACOC TAATGTTCTT G^GAGCATGA AAAATGTTGC AGAAAAATTC TTTGGCAAGG 2760 

TTAGAGTTGA CASAAATTGC GTAAGTATAT CAAAGAAGAT GGCATACTTA GTTATGAAAT 2820 

GCCTCTGTGG ACCAXIAGOC GAAAACAAGA GAAXPCCTTC TGTTATACTC ACCTCTCCOG 2880 

AACCGGTACG GTGGTCAITT TTAGAGGCGT ATTTM3JGG CGATGGAGAT ATACATCCAT 2940 

CAAAAAGGTT TPGGCICTCA ACAAAAAGCG MCTOTITGC AAA0TAGCTT GTGTTCTTGC 3000 

TGAACTCTTT GGGAAIAXCC TCIGTAAAGA TMGCITTGA CNGTGGGGTC TATAGAGTGT 3060 

ATAXAAATGA AGACCTGCAA 1TICCACAAA CGTCTAGGGA GAAAAACACA TACIT^CTCTA 3120 

ACITAATTCC CAAAGAGATC CITAGGGACG TGTTIGGAAA AGAGTTCCAA AAGAACATGA 3180 

CGTTCAAGAA ATTTAAAGAG CITGTTGACT CTGGAAAACT TAACAGGGAG AAAGCCAAGC 3240 

TCTTGGAGIT C7TTCAITAAT GGAGATAXTG TCCTTGACAG AGTCAAAAGT GTTAAAGAAA ' 3300 

AGGACIATGA AGGCTATCTC TATGACCTAA GCGTIGAGGA TAACGAGAAC TTTCTTGTTG 3360 

GTITTTGCTTT GCTCIATGCT CACAACAGCT ATXACGGCTA TATGGGGTAT CCTAAGGCAA 3420 

GATGGTACTC GAAGGAATCT GCTGAAAGOG TTAOCGCATG GGGGAGACAC TACATAGAGA 3480 

FIG. 6-3 
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"^^ ^^ GAAAAGTTOG GdTIAAGGT TCTTTATGCG GAQ\CTGTCT 
CAGGAGAAAG TC^GATCAIA ATAAGGCAAA MGGAAAGAX TAGAnTCTC AAAAIAAAGG 

McriTrcrc tmggiggac ^qgcaitg gogaaaaaga at^catt ctcgaagcttc 

TTGAAGC^ ASCICIGGAC GATCACGGAA AGCITCICIG GAAGCCCGTC CCCITVC^A 
T^AGGCACAG MOGAASMA AGAAXGTTOC GCAICITOCT GAOCAACMC TGGTAT?t'I7jG 
ATGTTACTGA GGAIQITCT CICA^GGCT ATCTAAACAC GTCAAAAACG AAAACTGCCA 
AAAAAATCGG GGAA^IA AAGGAAGTAA AGOCITTTGA ATTAGGCAAA GCAGTAAAAT 
CX3CTCATATG CCCAAATCCA CCGTE\AAGG AXGAGAAIM C^AAACIMC GAAATAGCAG 
TAAAAXTCTG GGAGCKCT^ GGATTGATTG TAGGAGATGG AAACTGGGGT GCSVGATTCTC 
CTTCGGCAGA CraTEATClT GGACITIX^A CAGGCAAAGA TGCAGAAGAG ATAAAGCAAA 
AaCTTCTQGA ACdoCTAAAA ACTTAXGCSG TAATCTCAAA CEATTAOCGA AAAAACGAGA 
AMGGCTCTT CASCATCTTG GCAAAGAGCC TTGTAAAGTT TATGAAAAGG CaCTTA*GG 
ACGAAAAAGG AAGAOGAAAA ATIXXAGAGTT TCATGTATCA GCZTOOGGTT ACITACAT7G 
AGGCATCTCT AOGAGG3CTG TITO^GCTG ATQGTACTGT AACIMC3GG AAGGGAGITC 
CAGAGATOG GCTAACAAAC ATTGATGCTG ACTTICEAAG GGAAGTAAGG A,VmC^T 
GGAITGTITGG AATITCAAAT TCAAXATTTG CIGAGACTAC TCCAAATCGC T7CAATGGTG 
TITCI^CTGG AACCTCTCA AAGCATCTAA GGATCAAAAA TAAGTGGOGT. TTTGCTGT^AA 4500 
GGATAGGCTT TTIAATCGAG AGAAAGCAGA AGAGACTCTT AGAACATTTA AAATC\GCCA 4 560 
GGCTAAAAAG GAAIAOCATA GATnTGGCT TTGATCTTGT GCATGTGAAA AAAGTCGAAG 4620 
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MATRCCAXA CCTGGGTrAC GITTATCACA TIGAAGICGA AGAGAOGCAT AGGTICTTTG 4630 

CAAACAACAX CCTGGXACAC AAXACTGACG GCITITATGC CACAATMCC GGGGAAAAGC 4740 

CTGAACTCAX TMAAAGAAA GCCAAGGAAT TCCTAAACTA OUMACICC AAACITCCAjG 4800 

gictccttga cxtitsgtat GAGGGcrrrr actpgagagg attctitctt acaaaaaagc 48 6o 

GCIMGCACT CATAGATGAA GAGGGCAGGA TAACAACAAG GGGCTTGGAA GITCIMXSGA 4920 

GAGATTGGAG TGAGATAGCT AAGGAGACTC AGGCAAAGGT TTTAGAGGCT AIT^CTTAAAG 4930 

AGGGAAGTGT TGAAAAAGCT GTAGAAGTTG TTAGMATCT TGTAGAGAAA ATRGCAAAAT 5040 

ACAGGGTTOC ACTTGAAAAG CTTGTTATCC ATGAGO^GAT TACCAGGGAT TTAAAGGACT 5100 

ACAAAGCCAT TGGCCCTGAT CTCGCGATAG CAAAAAGACT TGOOGCAAGA GGGATAAAAG 5160 

TCAAMCGGG CACAATAATA AGCTATATCG TTCTCAAAGG GAGCGGAAAG ATOVGCGATA 5220 

GGGTAATITT ACTTACAGAA TACGATCCIA GAAAACACAA CTACGAICCG GACTACTACA 5280 

TAGAAAACCA AGTTCTGCCG GCACTACTTA GGAXACTCGA AGCGTTTGGA TACAGAAAjGG . 5340 

AGGATTTAAG GXATCAAAGC TCAAAACAAA COGGCITPjGA TGCATGGCTC AAGT^GGT^jGC 5400 

TCTGTITGCIT TTTAGIOCAA GTTTCTCCGC GAGTCTCTCT ATCTCTCTTT TCTATICICC 5460 

TATCTGGTTr TCATTCACTA TTAAGTMTC CGOCAAAjGOC ATAACGCTTC CAATTCCAAA 5520 

CTTGAGCTCT TTOCACTCTC TGGCCTCAAA TTCACTCCAT GTTTITGGAT OGTTCGCTTCT 5530 

CCCTCTICTG CTAAjGCCTCT OGAA^TCTTTT TCTTGGCGAA GAGTGTACAG CTATGATGAT 5640 

TATCTCTTCC TCTGGAAACG CATCTTCAAA CGTCTGAATT TCATCTAGAG ACCTCACTCC 5700 

GTCGAITATA ACTGCCTTGT ACTTCTTEAG TAGITCTTIT ACCITTGGGA TCGTTAATTT 57 60 

TGOCAOGGCA TTGTOOCCAA GCICCTGCCT AAGCTGAATG CTCACACTGT TCATACCTTC 5520 

GGGAGITCTT GGGATCC 5837 

FIG. 6-5 
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FIG.9 
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Pyrococcus sp. DNA Polymerase Gene Region 
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GGATCCCTCTCTTTTTGGTAACCCCATACGTCATTCCCTCAACCAAA-ACT 

TCAGCATCGlTGCAGTGGTCAGTGTGTCTGTGGGAGATGAAGAGGACGTC 

GATTTTTCTGGGGTCTATCTTGTATCTCCACATTCTAACTAACGCTCCAG 

GCCCAGGATCAACGTAGATGTTTTTGCTCGCCTTAATGAAGAAGCCACCA 

GTGGCTCTTGCCTGCGTTATCGTGACGAACCTTCCACCACCGCCACCGAG 

AAAAGTTATCTCTATCATCTCACACCTCCCCCATAACATCACCTGCTCAA 

TTTTTAAGCGTTCTTAAAGGCTTAAATACGTGAATTTAGCGTAAATTATT 

GAGGGATTAAGTATGATACTTGACGCTGACTACATCACCGAGGATGGGAA 

GCCGATTATAAGGATTTTCAAGAAAGAAAACGGCGAGTTTAAGGTTGAGT 

ACGACAGAAACTTTAGACCTTACATTTACGCTCTCCTCAAAGATGACTCG 

CAGATTGATGAGGTTAGGAAGATAACCGCCGAGAGGCATGGGAAGATAGT 

GAGAATTATAGATGCCGAAAAGGTAAGGAAGAAGTTCCTGGGGAGGCCGA 

TTGAGGTATGGAGGCTGTACTTTGAACACCCTCAGGACGTTCCCGCAATA 

AGGGATAAGATAAGAGAGCATTCCGCAGTTATTGACATCTTTGAGTACGA 

CATTCCGTTCGCGAAGAGGTACCTAATAGACAAAGGCCTAATTCCAATGG 

AAGGCGATGAAGAGCTCAAGTTGCTCGCATTTGACATAGAAACCCTCTAT 

CACGAAGGGGAGGAGTTCGCGAAGGGGCCCATTATAATGATAAGCTATGC 

TGATGAGGAAGAAGCCAAAGTCATAACGTGGAAAAAGATCGATCTCCCGT 

ACGTCGAGGTAGTTTCCAGCGAGAGGGAGA7GATAAAGCGGTTCCTCAAG 

GTGATAAGGGAGAAAGATCCCGATGTTATAATTACCTACAACGGCGATTC 

TTTCGACCTTCCCTATCTAGTTAAGAGGGCCGAAAAGCTCGGGATAAAGC 

TACCCCTGGGAAGGGACGGTAGTGAGCCAAAGATGCAGAGGCTTGGGGAT 

ATGACAGCGGTGGAGATAAAGGGAAGGATACACTTTGACCTCTACCACGT 

GATTAGGAGAACGATAAACCTCCCAACATACACCCTCGAGGCAGTTTATG 

AGGCAATCT7CGGAAAGCCAAAGGAGAAAGT7TACGCTCACGAGATAGCT 

GAGGCCTGGGAGACTGGAAAGGGACTGGAGAGAGTTGCAAAGTATTCAAT 

GG AGG AT GC AAAGGT AACGT ACG AGC TC G G7AGGGAGTTCT.TCCC AAT GG 

AGGCCCAGCTTTCAAGGTTAGTCGGCCAGCCCCTGTGGGATGTTTCTAGG 

TCTTCAACTGGCAACTTGGTGGAGTGGTACCTCC7CA.GGAAGGCCTACGA 

G AG G AAT G A AT T G G C 7C C AAAC AAG C C GG AT G AG AG GG AG T AC G AG A G A A 

G G C T AAG GG AG AG CTACGCTGGGGGATACGT 7 AAGG AG C C G G AG AAA G G G 

CTCTGGGAGGGG77AGTTTCCCTAGATTTCAGGAGCCTG7ACCCCTCGAT 

AAT AATC AC C C AT AAC GTCT C ACC GG AT AC G C 7G AAC AGGG AAGGGTG T A 

GG G AAT ACGATGTCGCC C C AG AG G T T G G GC AC AAG T T C TG C AAG G AC T T C 

CCGGGG777 AT C C C C AGCCT G C7 C AAG AGG7 7 AT T G G AT G AAAGG C AAG A 

AAT AAAAAG G AAG AT G AAAG C 77 CT AAAG AC C C AAT C G AG AAG AAG AT G C 

TT G AT T AC A G G C AAC G GG C AATC AAAAT C C 7 G G C AAAC AGC AT T 7 T AC C G 

G AA G AA 7 G G G 7 7 C C A C T AAT 7 AAAAA C G G 7 AAA.G T 7 AA. G AT A.T T C C G C A. T 

TGGGGACTTCGTTGATGGACTTATGAAGGCGAACCAAGGAAAAGTGAAGA 

AAACGGGGGATACAGAAGTTTTAGAAGTTGCAGGAATTCATGCGTTTTCC 

TTTGACA.GGAAGTCC AAGAAGGCCCGTGTAATGGCAGTGAAAGCCGTGAT 

A.AG AC AC C G T T ATT C C GG AAAT G T TT AT A.G AA T AGTC T T AAAC T C T G G T A. 

G AAAAA7 AAC AAT AAC AG AAG G G C AT AGCCTA.TTTGTCT AT AG G A-AC G G G 

GATCTCG7TGAGGCAACTGGGGAGGATGTCAJ*AATTGGGGATCTTCTTGC 

AGTTCCAAGATCAGTAAACCTACCAGAGAAAAGGGAACGCTTGAATATTG 

TTGAACTTCTTCTGAATCTCTCACCGGAAGAGACAGAAGATATAATACTT 

ACGATTCCAGTTAAAGGCAGAAAGAACTTCTTCAAGGGAATGTTGAGAAC 

ATTACGTTGGATTTTTGGTGAGGAAAAGAGAGTAAGGACAGCGAGCCGCT 

ATCTAAGACACCTTGAAAATCTCGGATACATAAGGTTGAGGAAAATTGGA 

TACGACATCATTGATAAGGAGGGGCTTGAGAAATATAGAACGTTGTACGA 

GAAACT7GTTGATGTTGTCCGCTATAATGGCAACAAGAGAGAGTATTTAG 
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TTGAATTTAATGCTGTCCGGGACGTTATCTCACTAATGCCAGAGGAAGAA 

CTGAAGGAATGGCGTATTGGAACTAGAAATGGATTCAGAATGGGTACGTT 

CGTAGATATTGATGAAGATTTTGCCAAGCTTCTTGGCTACTATGTGAGCG 

AGGGAAGTGCGAGGAAGTGGAAGAATCAAACTGGAGGTTGGAGTTACACT 

GTGAGATTGTACAACGAGAACGATGAAGTTCTTGACGACATGGAACACTT 

AGCCAAGAAGTTTTTTGGGAAAGTCAAACGTGGAAAGAACTATGTTGAGA' 

TACCAAAGAAAATGGCTTATATCATCTTTGAGAGCCTTTGTGGGACTTTG 

GC AG AAAACAAAAGGGTTCCTGAGGTAATCTTT AC CTCATCAAAGGGCGT 

TAGATGGGCCTTCCTTGAGGGTTATTTCATCGGCGATGGCGATGTTCACC 

CAAGCAAGAGGGTTCGCCTATCAACGAAGAGCGAGCTTTTAGTAAATGGC 

CTTGTTCTCCTACTTAACTCCCTTGGAGTATCTGCCATTAAGCTTGGATA 

CGATAGCGGAGTCTACAGGGTTTATGTAAACGAGGAACTTAAGTTTACGG 

AATACAGAAAGAAAAAGAATGTATATCACTCTCACATTGTTCCAAAGGAT 

AT T C T C AAAG AAACT T T TGGT AAG G T CTTC C AG AAAAAT AT AAG TT AC AA 

GAAATTTAGAGAGCTTGTAGAAAATGGAAAACTTGACAGGGAGAAAGCCA 

AAC GCATT GAGTGGT TACTTAAC GG AG AT AT AGTCCT AG AT AG AGTCGT A 

GAG .AT T AAG AG AG AG T AC TAT G AT GG TT AC G T 7 T AC G AT C T AAG T G T C G A 

T G AAG AT G AG AAT T T C C T TG 
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BESTFIT comparison of the deduced amino acid sequences of 
Pyrococcus DNA Polymerase (top line] to T.litoralis DNA Polymerase (bottom line) 

BESTFIT of: Pyrococcus .Pep from: 1 to: 1019 

TRANSLATE DNA of: Pyrococcus . seq from: 363 to: 3420 
to: T.litoralis. Pep from: 1 to: 1100 

TRANSLATE DNA of: T . litoralis . seq from: 291 to: 5401 

Percent Similarity: S3. 219 Percent Identity: 68.302 



1 MILDADYITEDGKPIIRIFKKENGEFKVEYDRNFRPYIYALLKDDSQIDE 50 

1 1 H ■ I I I' • I I I I H I I I II I I ( I I | : | . | . : | . | | t | I | | | | | | . | : | 
1 MILDTDYITKDGKPIIRIFKKENGEFKIELDPHFQPYIYALLKDDSAIEE 50 

51 VRKITAERHGKIVRIIDAEKVRKKFLGRPIEVWRLYFEHPQDVPAIRDKI 100 

: : I • : I I I I I • I I : : I I I I I I I I I I I . : I t I : I . ! I M I I t f I : I : | | 
51 IKAIKGERHGKTWVLDAVKVRKKFLGREVEWKLIFEHPQDVPAWRGKI 100 

101 REHS AVIDIFEVDIPFAKRYLIDKGLIPMEGDEELKLLAFDIETLYHEGE 150 

1 1 f • I ' : I I : I I I I I I I I ! | t I I I I I I I I j | I | | | | | l | | | | | | : | t | | : 
101 REHPAWDIYEYDIPFAKRYLIDKGLIPMEGDEELKLLAFDIETFYHEGD 150 

151 EFAKGP IIMIS YADEEEAKVITWKKIDLPYVEWSSEREMIKRFLKVIRE 20 0 

1 1 : I ' • I I I I I I I I I I I I : M I II ■ I I II I I : I I I . I t I I II II : . I : : 1 
151 EFGKGEIIMISYADEEEARVITWKNIDLPYVDWSNEREMIKRFVQWKE 200 

201 KDPDVI ITYNGDSFDLPYLVKRAEKLGIKLPLGRDG . . SEPKMQRLGDMT 248 

"! II I I M II I .MIIICIIIM ||::[.M|| .|!|:||:|| 
201 KDPDVI IT YNGDNFDLPYLIKRAEKLGVRLVLGRDKEHPEPKIQRMGDSE" 250 

249 AVEIKGRIHFDLYHVIRRTINLPTYTLEAVYEAIFGKPKEKVYAHEIAEA 298 

1 I I I I I I I I II I :. I : I I I I I | I | | | | | | | ( | | :: t |. | .| : J . | | | . 
251 AVEIKGRIHFDLFPWRRTINLPTYTLEAVYEAVLGKTKSKLGAEEIAAI 300 

299 WETGKGLERVAKYSMEDAKVTYELGRSFFPMEAQLSRLVGQPLWDVSRSS 348 

I I I I . I I I I I I :. M II I : I I I I I I | : | | : | | | | | | | | | 

301 WETEESMKKLAQYSMEDARATYELGXEFFPMEAELAXLIGQSVWDVSRSS 350 

349 TGNLVEWYLLRKAYERNELAPNKPDEREYERRLRESYAGGYVKEPEKGLW 398 

imiiiiiiii I I . I I I I I | | | | | | ||. MM.. | j l | l I i I f f 1 I 1 

351 TGNLVEVm.LRVAYARNELAPNKPDEEEYKRRLRT7YLGGYVKEPEKGLW 400 

399 EGLVSLDFRSLYPSIIITHNVSPDTLNREGCREYDVAPEVGHKFCKDFPG 448 

' - :: I MM I I III 1:1! I II Ml I ::|l |:: II.:!! Ill I I 

401 ENI I YLD FRS LYP S 1 1 VTHNVS PDTLEKEGCKNYD VAP IVGYRFCKDFPG 450 

449 FIPSLLKRLLDERQEIKRKMKASKDPIEKKMLDYRQRAIKILANS I LP EE 498 

Hll:l I:- 11:11:111.. I I I I II I I I I I I I I I I : I | 1 | | | | : ( 
451 FIPSILGDLIAMRQDIKKKMKSTIDPIEKKMLDYRQRAIKLLANSILPNE 500' 

499 WVPLIKNGKVKIFRIGDFVDGIJ^KANQGKVKKTGDTEVLEVAGIHAFSFD 548 

crt l:l:I.M-:l:-:ll:|:::.|. . . : . I I . . : : I I I I | | . . : ||||: 

501 WLPIIENGEIKFVKIGEFINSYMEKQKENVKTVENTEVLEVNNLFAFSFN 550 
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549 RKSKKARVMAVKAVIRHRYSGNVYRIVLNSGRKITITEGHSLFVYRNGDL 598 

: 1 I . - I . I I I : I I I : I . I . . I I I . M I I I . I I . I II ! I . Ill:: 
551 KKIKESEVKKVKALIRHKYKGKAYE IQLSSGRKINITAGHSLFTVRNGE I 600 

599 VEATGEDVKXGDLLAVPRSVNLPEKRERLNIVELLLNLSPEETEDIILTI 648 

I - . I : : : I I I I : . . I : . : . I II . : 1 1 . I I : : I I . I I I . I I : : 1 I 
601 KEVSGDGIKEGDLIVAPKKIKLNEKGVSINIPELISDLSEEETADIVMTI 650 

649 PVKGRKNFFKGMLRTLRWIFGEE.KRVRTASRYLRHLENLGYIRLRKIGY 697 

. I I I I I I I I I M I 1 I I 1 : I I It : I : I I .III 111*11.1:1 . It 
651 SAKGRKNFFKGMLRTLRWMFGEENRRIRTFNRYLFHLEKLGLIKLLPRGY 700 

698 D I IDKEGLEKYRTLYEKLVD VVRYNGNKRE YLVEFNAVRD VI S LMP EEEL 747 

::.!.! I . 11 : Mill.: 1:1111111111 II . : : 1 . I I . : I : . I I 
701 EVTDWERLKKYKQLYEKLAGSVKYNGNKREYLVMFNEIKDFISYFPQKEL 750 

748 KEWRIGTRNGFRMGTFVDIDEDFAKLLGYYVSEGSARKWKNQTGGWSYTV 797 

.11:111 MM I I I I : I I I I I I I I I I I . I I . I M I I . I 

751 EEWKIGTLNGFRTNCILKVDEDFGKLLGYYVSEGYAGAQKNKTGGISYSV 800 

798 RLYNENDEVLDDMEHLAKKFFGKVKRGKNYVEIPKKMAYI IFESLCGTLA 847 

: I I I I : - : II : . I . : : I . M II if: : : I : I . I . I M II : : : . : I I I . I I 
801 KLYNEDPNVLESMKNVAEKFFGKVRVDRNCVSISXKMAYLVMKCLCGALA 850 

848 ENKRVPEVIFTSSKGVRWAFLEGYFIGDGDVHPSKRVRLSTKSELLVNGL 897 

I M I : I M I : M . . . II I . I II : I I . I M I : I I I I I . II I M I I II . I M 
851 ENKRIPSVILTSPEPVRWSFLEAYFTGDGDIHPSKRFRLSTKSELLANQL 900 

898 VLLLNSLGVSAIKLGYDSGVYRVYVNEELKFTEYRKKKNVYHSHIVPKDI 947 

I : II I I II : 1 . : I : I : I II I II II : I I : I . I . : . : . I I . I . I : : : M : I 
901 VFLLNSLGISSVKIGFDSGVYRVY INEDLQFPQTSREKNTYYSNLIPKEI 950 

LKETFGKVFQKNISYKKFRELVENGKLDREKAKRIEWLLNGDIVLDRWE 9 97 
I : : ■ I I I I I II : . : I II : M I : . I I I : I I II I : I : : : M M II I II- . 
LRD VFGKEFQKNMTFKKFKELVDSGK LNREKAKLLEFF INGD I VLDRVKS 1000 

9 98 IKREYYDGYVYDLSVDEDENFL 1019 
: I . I : I II I I I II ::: I II I 
1001 VKEKDYEGYVYDLSVEDNENFL 1022 
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